# Tag Index

- Apache Spark 1
- Python 1
- Python-3 1
- R 28
- categorical data 1
- classification 8
- clustering 2
- competition 2
- data analysis 12
- data-pipeline 1
- descriptive-statistics 1
- geocoding 1
- imbalanced data 3
- interview 2
- prediction 1
- preprocessing 12
- python 3
- visualization 1
- web scraping 1

## Apache Spark

## Python

## Python-3

## R

- Predicting the misclassification cost incurred in air pressure system failure in heavy vehicles
- A classification approach to predicting air crash survival
- Employee flight risk modeling behavior
- Scraping twitter data to visualize trending tweets in Kuala Lumpur
- To eat or not to eat! That's the question? Measuring the association between categorical variables
- Learning a classifier from census data
- Predicting employment related factors in Malaysia- A regression analysis approach
- Predicting rubber plantation yield- A regression analysis approach
- Basic assumptions to be taken care of when building a predictive model
- Data Transformations in R
- Sold! How do home features add up to its price tag?
- Learning from data science competitions- baby steps
- Data Splitting
- Batch Geo-coding in R
- To read multiple files from a directory and save to a data frame
- Gini index to compute inequality or impurity in the data
- Assessing Clustering Tendency in R
- Packages for data mining algorithms in R and Python
- How to create a dissimilarity matrix for mixed type dataset
- Hierarchical Clustering Methods implementation in R- A Case Study
- How to solve the missing data problem?
- How to split a data frame in R with over a million observations in above 50 variables?
- Connect R to SQL Server 2014
- Data Analysis with R Series- Part 1
- Splitting a data frame into training and testing sets in R
- Data preprocessing with R- part II
- Data preprocessing with R
- How to read CSV file into R

## categorical data

## classification

- Risky loan applicants data analysis case study
- Predicting the misclassification cost incurred in air pressure system failure in heavy vehicles
- A classification approach to predicting air crash survival
- Employee flight risk modeling behavior
- Learning a classifier from census data
- Predicting rubber plantation yield- A regression analysis approach
- Basic assumptions to be taken care of when building a predictive model
- Learning from data science competitions- baby steps

## clustering

- Assessing Clustering Tendency in R
- Hierarchical Clustering Methods implementation in R- A Case Study

## competition

- Sold! How do home features add up to its price tag?
- Learning from data science competitions- baby steps

## data analysis

- Employee flight risk modeling behavior
- Scraping twitter data to visualize trending tweets in Kuala Lumpur
- To eat or not to eat! That's the question? Measuring the association between categorical variables
- Learning a classifier from census data
- Predicting employment related factors in Malaysia- A regression analysis approach
- Predicting rubber plantation yield- A regression analysis approach
- Basic assumptions to be taken care of when building a predictive model
- Big or small-lets save them all- Visualizing Data
- Big or small-lets save them all- Making Data Management Decisions
- Big or small-lets save them all-Exploratory Data Analysis
- How to create a dissimilarity matrix for mixed type dataset
- Data Analysis with R Series- Part 1

## data-pipeline

## descriptive-statistics

## geocoding

## imbalanced data

- Risky loan applicants data analysis case study
- Predicting the misclassification cost incurred in air pressure system failure in heavy vehicles
- A classification approach to predicting air crash survival

## interview

- Predicting the misclassification cost incurred in air pressure system failure in heavy vehicles
- A classification approach to predicting air crash survival

## prediction

## preprocessing

- Data Transformations in R
- Sold! How do home features add up to its price tag?
- Data Splitting
- Batch Geo-coding in R
- To read multiple files from a directory and save to a data frame
- Assessing Clustering Tendency in R
- How to solve the missing data problem?
- How to split a data frame in R with over a million observations in above 50 variables?
- Splitting a data frame into training and testing sets in R
- Data preprocessing with R- part II
- Data preprocessing with R
- How to read CSV file into R

## python

- Big or small-lets save them all- Visualizing Data
- Big or small-lets save them all- Making Data Management Decisions
- Big or small-lets save them all-Exploratory Data Analysis