JK thinks – Page 2 – Thoughts on New Business

What Matters in E-commerce

Post author By JK
Post date May 18, 2015
No Comments on What Matters in E-commerce

Next growth for e-commerce According to a recent news article, EU Commission is investigating antitrust issues in e-commerce. Everyone knows that by and large online shopping is replacing legacy shopping channels and some of the global players are dominant in this sector with their prowess. With this perspective in your mind, what is important is […]

Data Science

Maximum-likelihood Estimation in R

Post author By JK
Post date April 17, 2015
No Comments on Maximum-likelihood Estimation in R

If you are a marketer of a sports team and your mission is to boost the sales of annual membership for home games, what do you do first? You may want to know about which factors you should focus on to encourage customers to renew their annual membership. Actually, you need to wonder what is […]

Data Science

Principal Component Analysis & Factor Analysis in R

Post author By JK
Post date March 31, 2015
No Comments on Principal Component Analysis & Factor Analysis in R

Let’s say, there is a chunk of survey data, which consists of more than fifty questions. Even the number of total respondents reaches 60,000. Maybe it will take you a lot of time to analyze them according to your original intention or analysis objectives. Most people try to classify data or divide them into pieces […]

Data Science

RFM: Simple & Efficient Way to Focus on Highly-responded Customers

Post author By JK
Post date February 16, 2015
No Comments on RFM: Simple & Efficient Way to Focus on Highly-responded Customers

When you try to focus on the target segments with a high response rate, RFM is one of the most useful methods. Most of all, RFM is intuitive and easy to get results in a way that it is a kind of heuristic analytics, which is different from a regression model. RFM is an acronym […]

Data Science

How to Interpret Texts

If you have data regardless of whether they are obtained from social media or other sources, the next step is to analyze the meaning of those data. However, it is difficult to interpret the natural language in terms of sentiment analysis. Fortunately, there are several ways to understand text data and even provide quantification. TextBlob […]

Data Science

How to Collect Tweets for Analysis

Post author By JK
Post date January 21, 2015
No Comments on How to Collect Tweets for Analysis

To analyze how a certain service or product is accepted in a market, many people have tried certain traditional methods such as market survey and FGI. However, it requires expenses and has some limitations of space and time needed to design the research from laying out questionnaire to obtaining survey respondents. There is a simpler […]

Data Science

[Clustering Analysis in R] #4. Data Analysis

Post author By JK
Post date December 31, 2014
No Comments on [Clustering Analysis in R] #4. Data Analysis

Finally, we can step into the process for clustering analysis, which is to separate customers for their characteristics and to find representative tendency of each group (segment). To that end, I will use two approaches: k-means and hierarchical clustering analysis. The former is to find if independent groups have high similarity from their representative observation […]

Data Science

[Clustering Analysis in R] #3. Data Diagnostics

Post author By JK
Post date December 30, 2014
No Comments on [Clustering Analysis in R] #3. Data Diagnostics

Now, we need to diagnose whether these data are adequate for analysis in a way that those results are not originated from biased sample distribution and correlated variables. To that end, muliticollinearity test clarifies correlation between independent variables and I used corrgram() for that matter, which is one of the packages in R. > install.packages(“corrgram”) […]

Data Science

[Clustering Analysis in R] #2. Data Processing

Post author By JK
Post date December 29, 2014
No Comments on [Clustering Analysis in R] #2. Data Processing

By and large, there are two types of data: quantitative and qualitative data. If you want to do any kind of analysis such as regression and classification, you need to transform qualitative data to quantitative. Most data that have a category can be transformed to dummy variable. For instance, male can be zero and female […]

Data Science

[Clustering Analysis in R] #1. Introduction & Data Gathering

Post author By JK
Post date December 28, 2014
No Comments on [Clustering Analysis in R] #1. Introduction & Data Gathering

Before Starting the Process The reason why I have started writing these postings is because, as a beginner and learner of data science, I wanted to share my knowledge about clustering analysis and develop them based on active discussions. Main objective of these postings is to understand the whole process from data gathering to […]