Want to create interactive content? It’s easy in Genially!
IMDB
Kareem Emad Elfrargi (3GENA)
Created on December 2, 2022
Start designing with a free template
Discover more than 1500 professional designs like these:
View
Audio tutorial
View
Pechakucha Presentation
View
Desktop Workspace
View
Decades Presentation
View
Psychology Presentation
View
Medical Dna Presentation
View
Geometric Project Presentation
Transcript
Speakers
Amira Tarek
Omar Elsherif
Kareem Emad
Amira Ibrhiam
Spekear 4
Spekear 3
Spekear 1
Spekear 2
IMDB
RATING FILMS AND SERIES SO , WE USE DATA SET FROM KOGGLE
IMDB - DATASET
INTRODUCTION
2022
WHAT IS IMDB
IMDb ( an abbreviation of Internet Movie Database) is an online database of information related to films, television series, home videos, video games
and streaming content online – including cast, production crew and personal biographies
LETS GO TO SEE MORE ..
IMDB Analysis 2006 to 2016
Data set was used
From KAGGLE
About , 10 dimensional 1000 Record
Preprocessing
BY REMOVEING Null and Duplicated data
Preprocessing data
- 128 | 64 NULL
Metascore
Rating
Revenue (Millions)
Guardians of the Galaxy
Prometheus
Split
Guardians of the Galaxy
Split
Hacksaw Ridge
Why Him?
The Lost City of Z
Hacksaw Ridge
Why Him?
Search Party
Hostel: Part II
Clean data to use it in our model !
Visulization to our data set
IMDB
Voting for movies
is null data
Related Columns
i need learn so visual data to show hte realted
which movies is good by showing the rate
where are nulls data in your dataset
SVM | SVR MODEL
is one of the most popular Supervised Learning algorithms, which is used for Classification as well as Regression problems .
KNN MODEL
| K-nearest neighbors algorithm : is one of the simplest Machine Learning algorithms based on Supervised Learning technique.
Linear Regression MODEL
is one of the easiest and most popular Machine Learning algorithms. It is a statistical method that is used for predictive analysis .
K-Means MODEL
is an unsupervised learning algorithm that is used to solve the clustering problems in machine learning or data science
Association Rules
finds interesting associations and relationships among large sets of data items
Correlation | Positive - Zero - Negative
NAÏVE BAYES
It is not a single algorithm but a family of algorithms where all of them share a common principle
A decision tree
is a structure that includes a root node, branches, and leaf nodes. Each internal node denotes a test on an attribute
Logistic regression
This type of statistical model (also known as logit model) is often used for classification and predictive analytics
KNN- Algorthims
Best Accuracy
Logistic Regression
Best Accuracy II
Logistic Regression
Best Accuracy II
Thank you!
Project Data Mining