Data Extraction Basics for Docs and Images with OCR and NER

Become a Data Extraction Expert with Python, Pandas, OCR, NER, and Spacy : Learn to Train and Build Real-World Solutions
4.15 (73 reviews)
Udemy
platform
English
language
Data Science
category
Data Extraction Basics for Docs and Images with OCR and NER
385
students
2.5 hours
content
Nov 2024
last update
$44.99
regular price

What you will learn

Learn how to extract data from PDFs, Word docs, scanned images, and more with ease.

Use Tesseract and PyTesseract to perform optical character recognition (OCR) on images with accuracy.

Develop a common pipeline for data extraction from different types of input documents.

Learn how to develop a robust data extraction workflow

Get started on how to use Spacy efficiently for labelling

Learn how to train Spacy for your own data set

Use Pandas to convert extracted data to a CSV format

Design a customizable technical OCR solution for data extraction

Screenshots

Data Extraction Basics for Docs and Images with OCR and NER - Screenshot_01Data Extraction Basics for Docs and Images with OCR and NER - Screenshot_02Data Extraction Basics for Docs and Images with OCR and NER - Screenshot_03Data Extraction Basics for Docs and Images with OCR and NER - Screenshot_04
4479904
udemy ID
1/6/2022
course created date
2/2/2022
course indexed date
Bot
course submited by