我正在使用此代码来计算文本文件中的相同单词。
filename = input("Enter name of input file: ")
file = open(filename, "r", encoding="utf8")
wordCounter = {}
with open(filename,'r',encoding="utf8") as fh:
for line in fh:
# Replacing punctuation characters. Making the string to lower.
# The split will spit the line into a list.
word_list = line.replace(',','').replace('\'','').replace('.','').replace("'",'').replace('"','').replace('"','').replace('#','').replace('!','').replace('^','').replace('$','').replace('+','').replace('%','').replace('&','').replace('/','').replace('{','').replace('}','').replace('[','').replace(']','').replace('(','').replace(')','').replace('=','').replace('*','').replace('?','').lower().split()
for word in word_list:
# Adding the word into the wordCounter dictionary.
if word not in wordCounter:
wordCounter[word] = 1
else:
# if the word is already in the dictionary update its count.
wordCounter[word] = wordCounter[word] + 1
print('{:15}{:3}'.format('Word','Count'))
print('-' * 18)
# printing the words and its occurrence.
for word,occurance in wordCounter.items():
print(word,occurance)
我需要它们按从大到小的顺序排列作为输出。例如:
单词 1:25
单词 2:12
单词 3:5 。 。 .
我还需要将输入获取为“.txt”文件。如果用户写入任何不同的内容,程序必须收到错误“写入有效的文件名”。
如何对输出进行排序并同时生成错误代码?
最佳答案
要按顺序打印,您可以在打印之前按出现次数对它们进行排序,如下所示:
for word,occurance in sorted(wordCounter.items(), key=lambda x: x[1], reverse=True):
print(word,occurance)
为了按照您想要的方式检查文件是否有效,您可以考虑使用:
import os
path1 = "path/to/file1.txt"
path2 = "path/to/file2.png"
if not path1.lower().endswith('.txt'):
print("Write a valid file name")
if not os.path.exists(path1):
print("File does not exists!")
关于python - 如何使用Python对字数进行排序?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/68445404/