Text Analytics with Python — A Practical Real-World Approach to Gaining Actionable Insights from your Data
Text analytics can be a bit overwhelming and frustrating at times with the unstructured and noisy nature of textual data and the vast amount of information available. “Text Analytics with Python” published by Apress\Springer, is a book packed with 385 pages of useful information based on techniques, algorithms, experiences and various lessons learnt over time in analyzing text data.
About the book
Derive useful insights from your data using Python. Learn the techniques related to natural language processing and text analytics, and gain the skills to know which technique is best suited to solve a particular problem.
Text Analytics with Python teaches you both basic and advanced concepts, including text and language syntax, structure, semantics. You will focus on algorithms and techniques, such as text classification, clustering, topic modeling, and text summarization.
A structured and comprehensive approach is followed in this book so that readers with little or no experience do not find themselves overwhelmed. You will start with the basics of natural language and Python and move on to advanced analytical and machine learning concepts. You will look at each technique and algorithm with both a bird’s eye view to understand how it can be used as well as with a microscopic view to understand the mathematical concepts and to implement them to solve your own problems.
Entire codebase with hands on examples are open-sourced and available for everyone on GitHub. Current examples are in Python 2.x but efforts are underway to port the code to Python 3.x for everyone’s benefit.
This book:
- Provides complete coverage of the major concepts and techniques of natural language processing (NLP) and text analytics
- Includes practical real-world examples of techniques for implementation, such as building a text classification system to categorize news articles, analyzing app or game reviews using topic modeling and text summarization, and clustering popular movie synopses and analyzing the sentiment of movie reviews
- Shows implementations based on Python and several popular open source libraries in NLP and text analytics, such as the natural language toolkit (
nltk
),gensim
,scikit-learn
,spaCy
andpattern