Want to create interactive content? It’s easy in Genially!

Reuse this genially

IMDB

Kareem Emad Elfrargi (3GENA)

Created on December 2, 2022

Start designing with a free template

Discover more than 1500 professional designs like these:

Audio tutorial

Pechakucha Presentation

Desktop Workspace

Decades Presentation

Psychology Presentation

Medical Dna Presentation

Geometric Project Presentation

Transcript

Speakers

Amira Tarek

Omar Elsherif

Kareem Emad

Amira Ibrhiam

Spekear 4

Spekear 3

Spekear 1

Spekear 2

IMDB

RATING FILMS AND SERIES SO , WE USE DATA SET FROM KOGGLE

IMDB - DATASET

INTRODUCTION

2022

WHAT IS IMDB

IMDb ( an abbreviation of Internet Movie Database) is an online database of information related to films, television series, home videos, video games

and streaming content online – including cast, production crew and personal biographies

LETS GO TO SEE MORE ..

IMDB Analysis 2006 to 2016

Data set was used

From KAGGLE

About , 10 dimensional 1000 Record

Preprocessing

BY REMOVEING Null and Duplicated data

Preprocessing data

- 128 | 64 NULL

Metascore

Rating

Revenue (Millions)

Guardians of the Galaxy

Prometheus

Split

Guardians of the Galaxy

Split

Hacksaw Ridge

Why Him?

The Lost City of Z

Hacksaw Ridge

Why Him?

Search Party

Hostel: Part II

Clean data to use it in our model !

Visulization to our data set

IMDB

Voting for movies

is null data

Related Columns

i need learn so visual data to show hte realted

which movies is good by showing the rate

where are nulls data in your dataset

SVM | SVR MODEL

is one of the most popular Supervised Learning algorithms, which is used for Classification as well as Regression problems .

KNN MODEL

| K-nearest neighbors algorithm : is one of the simplest Machine Learning algorithms based on Supervised Learning technique.

Linear Regression MODEL

is one of the easiest and most popular Machine Learning algorithms. It is a statistical method that is used for predictive analysis .

K-Means MODEL

is an unsupervised learning algorithm that is used to solve the clustering problems in machine learning or data science

Association Rules

finds interesting associations and relationships among large sets of data items

Correlation | Positive - Zero - Negative

NAÏVE BAYES

It is not a single algorithm but a family of algorithms where all of them share a common principle

A decision tree

is a structure that includes a root node, branches, and leaf nodes. Each internal node denotes a test on an attribute

Logistic regression

This type of statistical model (also known as logit model) is often used for classification and predictive analytics

KNN- Algorthims

Best Accuracy

Logistic Regression

Best Accuracy II

Logistic Regression

Best Accuracy II

Thank you!

Project Data Mining