python统计文本文件内单词数量的方法

649次阅读  |  发布于5年以前

本文实例讲述了python统计文本文件内单词数量的方法。分享给大家供大家参考。具体实现方法如下:


    # count lines, sentences, and words of a text file
    # set all the counters to zero
    lines, blanklines, sentences, words = 0, 0, 0, 0
    print '-' * 50
    try:
     # use a text file you have, or google for this one ...
     filename = 'GettysburgAddress.txt'
     textf = open(filename, 'r')
    except IOError:
     print 'Cannot open file %s for reading' % filename
     import sys
     sys.exit(0)
    # reads one line at a time
    for line in textf:
     print line,  # test
     lines += 1
     if line.startswith('\n'):
      blanklines += 1
     else:
      # assume that each sentence ends with . or ! or ?
      # so simply count these characters
      sentences += line.count('.') + line.count('!') + line.count('?')
      # create a list of words
      # use None to split at any whitespace regardless of length
      # so for instance double space counts as one space
      tempwords = line.split(None)
      print tempwords # test
      # word total count
      words += len(tempwords)
    textf.close()
    print '-' * 50
    print "Lines   : ", lines
    print "Blank lines: ", blanklines
    print "Sentences : ", sentences
    print "Words   : ", words
    # optional console wait for keypress
    from msvcrt import getch
    getch()

希望本文所述对大家的python程序设计有所帮助。

Copyright© 2013-2020

All Rights Reserved 京ICP备2023019179号-8