Top 10 Data Science Use cases in Telecom
In the course of time, data science has proved its high value and efficiency. Data scientists find more and more new ways to implement big… Read More »Top 10 Data Science Use cases in Telecom
In the course of time, data science has proved its high value and efficiency. Data scientists find more and more new ways to implement big… Read More »Top 10 Data Science Use cases in Telecom
In a recent article (February 2019) published in Forkes (see here) it was argued that there will be no data science job titles by 2029. The… Read More »Debunking Forbes Article about the Death of the Data Scientist
Summary: The Gartner Magic Quadrant for Data Science and Machine Learning Platforms is just out and once again there are big changes in the leaderboard. … Read More »Advanced Analytic Platforms – Incumbents Fall – Challengers Rise
Is innovation an artisanal feat? Can innovation be weighed, measured on a scale, and optimized like a production operation? Can the use of data analytics… Read More »The Future of Innovation Analytics
The gaming industry is on its rise nowadays. With more than 2 billion players all over the world gaming industry is a resource of enormous… Read More »Top 8 Data Science Use Cases in Gaming
Introduction to topic model: In machine learning and natural language processing, a topic model is a type of statistical model for discovering the abstract “topics” that occur in a collection of documents. Topic modeling is a frequently used text-mining tool for discovery of hidden semantic structures in a text body. In topic modeling, a topic is defined by a cluster of words with each word in the cluster having a probability of occurrence for the given topic, and different topics have their respective clusters of words along with corresponding probabilities. Different topics may share some words and a document can have more than one topic associated with it. Overall, we can say topic modeling is an unsupervised machine learning way to organize text (or image or DNA, etc.) information such that related pieces of text can be identified. Architecture of topics : Figure 1. Organization of topics Figure1. Architecture of topics Figure 1. Shows the organization of topics. Topics comes in two forms, a flat and a hierarchical. In the other word, there are two methods for topic detection (topic model) a flat and a hierarchical. The flat topic models means, these topic models and their extensions can only find topics in a flat structure, but fail to discover the hierarchical relationship among topics. Such a drawback can limit the application of topic models since many applications need and have inherent hierarchical topic structures, such as the categories hierarchy in Web pages , aspects hierarchy in reviews and research topics hierarchy in academia community . In a topic hierarchy, the topics near the root have more general semantics, and the topics close to leaves have more specific semantics. Topic model approach: figure 2 topic modeling approach A popular topic modeling approaches are illustrated in figure 2. These approaches could be divided into two categories, i.e., probabilistic methods and non-probabilistic methods. Probabilistic methods usually model topics as latent factors, and assume that the joint probability of the words and the documents could be described by the mixture of the conditional probabilities over the latent factors i.e., LDA and HDP. On the contrary, non-probabilistic methods usually use NMF and dictionary learning to uncover the low-rank structures using matrix factorization.NMF methods extends matrix factorization based methods to find the topics in the text stream. All the above mentioned methods are static topic detection methods and cannot handle the topic evolving process in the temporal dimension. Thus, various extensions of these methods have been proposed to handle this issue. Reference: Xu, Y., Yin, J., Huang, J., & Yin, Y. (2018). Hierarchical topic modeling with automatic knowledge mining. Expert Systems with Applications, 103, 106-117. Chen, J., Zhu, J., Lu, J., & Liu, S. (2018). Scalable training of hierarchical topic models. Proceedings of the VLDB Endowment, 11(7), 826-839.
Interaction plots are used to understand the behavior of one variable depends on the value of another variable. Interaction effects are analyzed in regression analysis,… Read More »The significance of Interaction Plots in Statistics
Background One of the hardest problem in AI is not technical It is social Specifically, it is the problem of “educating people for living and… Read More »Educating for AI – one of the most critical problems in AI
Key points of this blog include: Digital Transformation sweeps aside traditional industry borders to create new sources of customer and operational value Unfortunately, Digital Transformation… Read More »Using the Digital Transformation Journey Workbook to Deliver “Smart” Spaces
Kafka monitoring is an important and widespread operation which is used for the optimization of the Kafka deployment. This process may be smooth and efficient… Read More »Kafka Monitoring with Prometheus, Telegraf, and Grafana