Datalab
@datalab-toOrganizationDeveloping state of the art document intelligence models.
Public repos
10
Public gists
0
Member since
Jul 25, 2024
United States of America
On the leaderboard
| Rank | Repository | Stars |
|---|---|---|
| 838 | datalab-to/marker | 33,344 |
Top repositories by stars
- Python31,728
- datalab-to/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
Python19,276 - datalab-to/chandra
OCR model that handles complex tables, forms, handwriting with full layout.
Python4,825 - datalab-to/pdftext
Extract structured text from pdfs quickly
Python661 - datalab-to/docext
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
Python9 - Python8
- datalab-to/datalab-on-prem
Scripts to run Datalab's self-service on-prem container
Shell4 - Python3
- Python2
- Python1