Skip to content

Latest commit

 

History

History
25 lines (18 loc) · 854 Bytes

README.md

File metadata and controls

25 lines (18 loc) · 854 Bytes

🍓 Classification of fruits images

OPENCLASSROOMS - Data Scientist - Project 8

This repository contains files for a Big Data project that featurize images with a MobileNetV2 model using Google Cloud Platform.

📊 Data

The dataset used for this project is the Fruits 360, which includes more than tens of thousands images of fruits (size 100x100 pixels).

📁 Files

  • notebook_local.ipynb : Images processing using PySpark (local instance)
  • notebook_cloud.ipynb : Images processing using PySpark (notebook uploaded on GCP)
  • pyspark_script.py : Images processing using PySpark (script run on GCP)
  • projet8_presentation.pdf: Final presentation of the project

🛠️ Tools

  • Python 3.x
  • Jupyter Notebook
  • NumPy
  • Pandas
  • Tensorflow
  • PySpark
  • GCP : Storage, Dataproc