to be replaced with blog posting to the questions below:

  1. What is data mining? What are some examples of how data mining can be used?
  2. What are the different steps of the pipeline? Briefly discuss each.
  3. Why is defining the problem first so important?
  4. Why is data cleaning/pre-processing important? What are some aspects of data that need to be cleaned (for example, dealing with null values)?
  5. Find an example of some data understanding/visualizations (e.g., blog post, portfolio). What do you like and/or dislike about it?

<
Previous Post
Cmpd_final_project
>
Next Post
Cmpd Exploration