image
image
| label
class label
2 classes
|
---|---|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
0
(boxes) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
|
1
(images) |
OCR GENERATED Machine-Readable Zone (MRZ) Text Detection
The dataset includes a collection of GENERATED photos containing Machine Readable Zones (MRZ) commonly found on identification documents such as passports, visas, and ID cards. Each photo in the dataset is accompanied by text detection and Optical Character Recognition (OCR) results.
This dataset is useful for developing applications related to document verification, identity authentication, or automated data extraction from identification documents.
The dataset is solely for informational or educational purposes and should not be used for any fraudulent or deceptive activities.
Get the dataset
This is just an example of the data
Leave a request on https://trainingdata.pro/data-market to discuss your requirements, learn about the price and buy the dataset.
Dataset structure
- images - contains of original images of documents
- boxes - includes bounding box labeling for the original images
- annotations.xml - contains coordinates of the bounding boxes and detected text, created for the original photo
Data Format
Each image from images
folder is accompanied by an XML-annotation in the annotations.xml
file indicating the coordinates of the bounding boxes and detected text . For each point, the x and y coordinates are provided.
Example of XML file structure
Text Detection in the Documents might be made in accordance with your requirements.
**TrainingData**
More datasets in TrainingData's Kaggle account: https://www.kaggle.com/trainingdatapro/datasets
TrainingData's GitHub: https://github.com/Trainingdata-datamarket/TrainingData_All_datasets
- Downloads last month
- 2