# Tag Index

- Apache Spark 1
- Python 1
- R 24
- categorical data 1
- classification 1
- clustering 2
- competition 2
- data analysis 10
- data-pipeline 1
- descriptive-statistics 1
- geocoding 1
- preprocessing 12
- python 3

## Apache Spark

## Python

## R

- To eat or not to eat! That's the question? Measuring the association between categorical variables
- Learning a classifier from census data
- Predicting employment related factors in Malaysia- A regression analysis approach
- Predicting rubber plantation yield- A regression analysis approach
- Basic assumptions to be taken care of when building a predictive model
- Data Transformations in R
- Sold! How do home features add up to its price tag?
- Learning from data science competitions- baby steps
- Data Splitting
- Batch Geo-coding in R
- To read multiple files from a directory and save to a data frame
- Gini index to compute inequality or impurity in the data
- Assessing Clustering Tendency in R
- Packages for data mining algorithms in R and Python
- How to create a dissimilarity matrix for mixed type dataset
- Hierarchical Clustering Methods implementation in R- A Case Study
- How to solve the missing data problem?
- How to split a data frame in R with over a million observations in above 50 variables?
- Connect R to SQL Server 2014
- Data Analysis with R Series- Part 1
- Splitting a data frame into training and testing sets in R
- Data preprocessing with R- part II
- Data preprocessing with R
- How to read CSV file into R

## categorical data

## classification

## clustering

- Assessing Clustering Tendency in R
- Hierarchical Clustering Methods implementation in R- A Case Study

## competition

- Sold! How do home features add up to its price tag?
- Learning from data science competitions- baby steps

## data analysis

- To eat or not to eat! That's the question? Measuring the association between categorical variables
- Learning a classifier from census data
- Predicting employment related factors in Malaysia- A regression analysis approach
- Predicting rubber plantation yield- A regression analysis approach
- Basic assumptions to be taken care of when building a predictive model
- Big or small-lets save them all- Visualizing Data
- Big or small-lets save them all- Making Data Management Decisions
- Big or small-lets save them all-Exploratory Data Analysis
- How to create a dissimilarity matrix for mixed type dataset
- Data Analysis with R Series- Part 1

## data-pipeline

## descriptive-statistics

## geocoding

## preprocessing

- Data Transformations in R
- Sold! How do home features add up to its price tag?
- Data Splitting
- Batch Geo-coding in R
- To read multiple files from a directory and save to a data frame
- Assessing Clustering Tendency in R
- How to solve the missing data problem?
- How to split a data frame in R with over a million observations in above 50 variables?
- Splitting a data frame into training and testing sets in R
- Data preprocessing with R- part II
- Data preprocessing with R
- How to read CSV file into R