Data Extraction Basics for Docs and Images with OCR and NER

Become a Data Extraction Expert with Python, Pandas, OCR, NER, and Spacy : Learn to Train and Build Real-World Solutions

4.15 (73 reviews)

Udemy

platform

English

language

Data Science

category

Vineeta Vashistha

instructor

Data Extraction Basics for Docs and Images with OCR and NER

385

students

2.5 hours

content

Nov 2024

last update

$44.99

regular price

What you will learn

Learn how to extract data from PDFs, Word docs, scanned images, and more with ease.

Use Tesseract and PyTesseract to perform optical character recognition (OCR) on images with accuracy.

Develop a common pipeline for data extraction from different types of input documents.

Learn how to develop a robust data extraction workflow

Get started on how to use Spacy efficiently for labelling

Learn how to train Spacy for your own data set

Use Pandas to convert extracted data to a CSV format

Design a customizable technical OCR solution for data extraction

Screenshots

Data Extraction Basics for Docs and Images with OCR and NER - Screenshot_01

Data Extraction Basics for Docs and Images with OCR and NER - Screenshot_02

Data Extraction Basics for Docs and Images with OCR and NER - Screenshot_03

Data Extraction Basics for Docs and Images with OCR and NER - Screenshot_04

Related Topics

Computer Vision

Natural Language Processing

4479904

udemy ID

1/6/2022

course created date

2/2/2022

course indexed date

Bot

course submited by