When analyzing data which does the data scientist look for? And what procedure they follow?
Can anyone tell me which does data scientists look for when analyzing data?
When analyzing data, a data scientist should follow this procedure:
Strategy: matching the problem with the solution
Dataset preparation and pre-processing
- Data collection
- Data visualization
- Labeling
- Data selection
Data preprocessing – Data formatting, Data cleaning, Data anonymization, Data sampling
Data transformation –Scaling, Decomposition, Aggregation