# CRISP-DM
Cross-Industry Standard Process for Data Mining
Is a standardized approach to data mining projects. CRISP-DM breaks down Data Mining in 6 easy steps.
# Steps
- Business Understanding
- understand business requirements & goals
- clarify how dm-process can bring value
- Data Understanding
- understand quality & relevance of data
- Data Preparation
- prepare data for analysis (choose & merge)
- data cleaning, transformation, integration, feature engenieering
- Modeling
- select data to build models
- identify patterns & trends in data
- Evaluation
- verify models meet business objectives
- Deployment
- integrate models into business processes
# Roles & Definitions
- Data Mining
- discover patterns, correlations from large datasets
- Data Science
- scientific methods, algorithms, systems to extract knowledge & insights from data
- statistics, ML, DM, visualization to analyze complex problems
- work in all 6 fields of CRISP
- Explorative Dataanalysis (EDA)
- analyze datasets to summarize their main characteristics
- understanding data’s structure, patterns, relationships
- Data Engineering
- Data Analyst
- Data Engineer
- Data Scientist
- DevOps/MLOps/DataOps