Skip to content

General solution for alphanumeric text-based captchas using Convolutional Neural Networks

Notifications You must be signed in to change notification settings

joel-huang/captcha-breaker

Repository files navigation

General solution for alphanumeric text-based captchas using Convolutional Neural Networks

50.038 Computational Data Science project

The main objective of this project is to equip and familiarize students with the necessary skills to successfully complete a data science project, including data collection and processing, data exploration and visualization, identifying and formulating problems, developing algorithms and models, designing experimental evaluations and discussing results, scientific writing and working in teams.

Repository structure

  • /data data folder, included in .gitignore
  • /generate code used to generate the Python captcha dataset
  • /captcha a local version of the captcha library, modified so bounding boxes are extractable
  • /tex source files for the report (compile using pdfLaTeX and BibTeX)

Project outcomes

  1. Dataset, Collection, Pre-processing
  • What are the characteristics of the dataset being used?
  • How is the dataset being collected, combined, augmented, etc?
  • What are the various steps in pre-processing the dataset?
  1. Problem Definition
  • Is the problem clearly defined?
  • Why is the selected problem important, hard, etc?
  1. Proposed Algorithm/Approach
  • Is the proposed algorithm/approach defined in sufficient details?
  • Why is this an appropriate algorithm/approach for the problem?
  1. Evaluation Methodology
  • How do you go about evaluating your proposed algorithm/approach?
  • What are the evaluation metrics and why are they appropriate?
  1. Results and Discussion
  • What are your main findings from this study?
  • Is there a fair discussion of possible issues and limitations of this study?
  1. Overall Presentation
  • How well structured and organized is the report?
  • Are the appropriate visualizations (graphs, charts, tables, etc) being used?
  1. Originality/Creativity
  • How original and/or creative is the proposed study in terms of the above-mentioned points?

About

General solution for alphanumeric text-based captchas using Convolutional Neural Networks

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published