Pattern
Pattern is a web mining module for Python. It has tools for Data Mining, Natural Language Processing, Network Analysis and Machine Learning. It supports vector space model, clustering, classification using KNN, SVM, Perceptron.
- Data Mining: web services (Google, Twitter, Wikipedia), web crawler, HTML DOM parser
- Natural Language Processing: part-of-speech taggers, n-gram search, sentiment analysis, WordNet
- Machine Learning: vector space model, clustering, classification (KNN, SVM, Perceptron)
- Network Analysis: graph centrality and visualization.
It is well documented and bundled with 50+ examples and 350+ unit tests. The source code is licensed under BSD and available from http://www.clips.ua.ac.be/pages/pattern.
Information
- Website: http://www.clips.ua.ac.be/pages/pattern
- GitHub: https://github.com/clips/pattern
- Documentation: http://www.clips.ua.ac.be/pages/pattern-dev
- Community: http://groups.google.com/group/pattern-for-python
Comments
comments powered by Disqus