Text Normalization with Spark – Part 2
Overview This is second in a two part series that talks about Text Normalization using Spark.In this blog post, we are going to understand the… Read More »Text Normalization with Spark – Part 2
Overview This is second in a two part series that talks about Text Normalization using Spark.In this blog post, we are going to understand the… Read More »Text Normalization with Spark – Part 2
Overview Nowadays, there are numerous risks related to bank loans both for the banks and the borrowers getting the loans. The risk analysis about bank… Read More »Loan Prediction – Using PCA and Naive Bayes Classification with R
Overview Call Detail Record (CDR) is the information captured by the telecom companies during Call, SMS, and Internet activity of a customer. This information provides… Read More »Call Detail Record Analysis – K-means Clustering with R
Overview Datameer, an end-to-end big data analytics platform, is built on Apache Hadoop to perform integration, analysis, and visualization of massive volumes of both structured… Read More »Importing and Analyzing Data in Datameer
Overview Dataiku Data Science Studio (DSS), a complete data science software platform, is used to explore, prototype, build, and deliver data products. It significantly reduces… Read More »Sales Data Analysis using DataIku Studio
Overview In the customer management lifecycle, customer churn refers to a decision made by the customer about ending the business relationship. It is also referred… Read More »Customer Churn – Logistic Regression with R
Data matching is the task of identifying, matching, and merging records that correspond to the same entities from several source systems. The entities under consideration… Read More »Data Matching – Entity Identification, Resolution & Linkage
A data-driven organization will use the data as critical evidence to help inform and influence strategy. To be data-driven means cultivating a mindset throughout the… Read More »Data Team for Data Driven Organization
Metabase, an open source, easy-to-use database visualization tool, is built and maintained by a dedicated Metabase team and comes with a Crate driver. It is… Read More »User Analytics using Metabase and MongoDB