Use popular data mining techniques in Microsoft Excel to better understand machine learning methods.
Software tools and programming language packages take data input and deliver data mining results directly, presenting no insight on working mechanics and creating a chasm between input and output. This is where Excel can help.
Excel allows you to work with data in a transparent manner. When you open an Excel file, data is visible immediately and you can work with it directly. Intermediate results can be examined while you are conducting your mining task, offering a deeper understanding of how data is manipulated and results are obtained. These are critical aspects of the model construction process that are hidden in software tools and programming language packages.
This book teaches you data mining through Excel. You will learn how Excel has an advantage in data mining when the data sets are not too large. It can give you a visual representation of data mining, building confidence in your results. You will go through every step manually, which offers not only an active learning experience, but teaches you how the mining process works and how to find the internal hidden patterns inside the data.
What You Will Learn
- Comprehend data mining using a visual step-by-step approach
- Build on a theoretical introduction of a data mining method, followed by an Excel implementation
- Unveil the mystery behind machine learning algorithms, making a complex topic accessible to everyone
- Become skilled in creative uses of Excel formulas and functions
- Obtain hands-on experience with data mining and Excel
Who This Book Is For
Anyone who is interested in learning data mining or machine learning, especially data science visual learners and people skilled in Excel, who would like to explore data science topics and/or expand their Excel skills. A basic or beginner level understanding of Excel is recommended.
Table of contents
1. Excel and Data Mining 2. Linear Regression 3. K-Means Clustering 4. Linear Discriminant Analysis 5. Cross-Validation and ROC 6. Logistic Regression 7. K-Nearest Neighbors 8. Naïve Bayes Classification 9. Decision Trees 10. Association Analysis 11. Artificial Neural Network 12. Text Mining 13. After Excel