image_paths = glob("MIDV-679/images/*.jpg") ann_paths = {os.path.basename(p).split('.')[0]: p for p in glob("MIDV-679/annotations/*.json")} MIDV-679
Overview MIDV-679 is a widely used dataset for document recognition tasks (ID cards, passports, driver’s licenses, etc.). This tutorial walks you from understanding the dataset through practical experiments: preprocessing, synthetic augmentation, layout analysis, OCR, and evaluation. It’s designed for researchers and engineers who want to build robust document understanding pipelines. Assumptions: you’re comfortable with Python, PyTorch or TensorFlow, and basic computer vision; you have a GPU available for training. import json, cv2, os from glob import glob
This website uses cookies
Our website uses necessary technical cookies. We would also like to set optional cookies (analytics and profiling cookies) and third-party cookies to give you the most relevant experience, for example by remembering your preferences with repeated visits. You can manage your preferences or accept all cookies. You can find more information about the cookies used in our cookie policy and have the option to change your cookie settings afterwards.