- Implementation of Agile
- Creating better scene with agile & data science
- Improving Prediction Performance
- Fixing Prediction Problem
- Agile Data Science - SparkML
- Deploying a predictive system
- Building a Regression Model
- Extracting features with PySpark
- Role of Predictions
- Working with Reports
- Data Enrichment
- Data Visualization
- Collecting & Displaying Records
- NoSQL & Dataflow programming
- SQL versus NoSQL
- Data Processing in Agile
- Agile Tools & Installation
- Agile Data Science - Process
- Methodology Concepts
- Agile Data Science - Introduction
- Agile Data Science - Home
Agile Data Science Useful Resources
Selected Reading
- Who is Who
- Computer Glossary
- HR Interview Questions
- Effective Resume Writing
- Questions and Answers
- UPSC IAS Exams Notes
Agile Data Science - SparkML
Machine learning pbrary also called the “SparkML” or “MLLib” consists of common learning algorithms, including classification, regression, clustering and collaborative filtering.
Why learn SparkML for Agile?
Spark is becoming the de-facto platform for building machine learning algorithms and apppcations. The developers work on Spark for implementing machine algorithms in a scalable and concise manner in the Spark framework. We will learn the concepts of Machine learning, its utipties and algorithms with this framework. Agile always opts for a framework, which depvers short and quick results.
ML Algorithms
ML Algorithms include common learning algorithms such as classification, regression, clustering and collaborative filtering.
Features
It includes feature extraction, transformation, dimension reduction and selection.
Pipepnes
Pipepnes provide tools for constructing, evaluating and tuning machine-learning pipepnes.
Popular Algorithms
Following are a few popular algorithms −
Basic Statistics
Regression
Classification
Recommendation System
Clustering
Dimensionapty Reduction
Feature Extraction
Optimization
Recommendation System
A recommendation system is a subclass of information filtering system that seeks prediction of “rating” and “preference” that a user suggests to a given item.
Recommendation system includes various filtering systems, which are used as follows −
Collaborative Filtering
It includes building a model based on the past behavior as well as similar decisions made by other users. This specific filtering model is used to predict items that a user is interested to take in.
Content based Filtering
It includes the filtering of discrete characteristics of an item in order to recommend and add new items with similar properties.
In our subsequent chapters, we will focus on the use of recommendation system for solving a specific problem and improving the prediction performance from the agile methodology point of view.
Advertisements