You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

63 lines
2.7 KiB

4 years ago
  1. Metadata-Version: 2.1
  2. Name: PyStemmer
  3. Version: 1.3.0
  4. Summary: Snowball stemming algorithms, for information retrieval
  5. Home-page: http://snowball.tartarus.org/
  6. Author: Richard Boulton
  7. Author-email: richard@tartarus.org
  8. Maintainer: Richard Boulton
  9. Maintainer-email: richard@tartarus.org
  10. License: ['MIT', 'BSD']
  11. Download-URL: http://snowball.tartarus.org/wrappers/PyStemmer-1.3.0.tar.gz
  12. Keywords: python,information retrieval,language processing,morphological analysis,stemming algorithms,stemmers
  13. Platform: any
  14. Classifier: Development Status :: 5 - Production/Stable
  15. Classifier: Intended Audience :: Developers
  16. Classifier: License :: OSI Approved :: MIT License
  17. Classifier: License :: OSI Approved :: BSD License
  18. Classifier: Natural Language :: Danish
  19. Classifier: Natural Language :: Dutch
  20. Classifier: Natural Language :: English
  21. Classifier: Natural Language :: Finnish
  22. Classifier: Natural Language :: French
  23. Classifier: Natural Language :: German
  24. Classifier: Natural Language :: Italian
  25. Classifier: Natural Language :: Norwegian
  26. Classifier: Natural Language :: Portuguese
  27. Classifier: Natural Language :: Russian
  28. Classifier: Natural Language :: Spanish
  29. Classifier: Natural Language :: Swedish
  30. Classifier: Operating System :: OS Independent
  31. Classifier: Programming Language :: C
  32. Classifier: Programming Language :: Other
  33. Classifier: Programming Language :: Python
  34. Classifier: Programming Language :: Python :: 2
  35. Classifier: Programming Language :: Python :: 2.6
  36. Classifier: Programming Language :: Python :: 2.7
  37. Classifier: Programming Language :: Python :: 3
  38. Classifier: Programming Language :: Python :: 3.2
  39. Classifier: Programming Language :: Python :: 3.3
  40. Classifier: Topic :: Database
  41. Classifier: Topic :: Internet :: WWW/HTTP :: Indexing/Search
  42. Classifier: Topic :: Text Processing :: Indexing
  43. Classifier: Topic :: Text Processing :: Linguistic
  44. Stemming algorithms
  45. PyStemmer provides access to efficient algorithms for calculating a
  46. "stemmed" form of a word. This is a form with most of the common
  47. morphological endings removed; hopefully representing a common
  48. linguistic base form. This is most useful in building search engines
  49. and information retrieval software; for example, a search with stemming
  50. enabled should be able to find a document containing "cycling" given the
  51. query "cycles".
  52. PyStemmer provides algorithms for several (mainly european) languages,
  53. by wrapping the libstemmer library from the Snowball project in a Python
  54. module.
  55. It also provides access to the classic Porter stemming algorithm for
  56. english: although this has been superceded by an improved algorithm, the
  57. original algorithm may be of interest to information retrieval
  58. researchers wishing to reproduce results of earlier experiments.