Multimodal OCR successor to dots.ocr — "parse anything from documents" — keeping the compact 3B footprint while extending coverage beyond layout/text/table/formula parsing. Includes a dots.mocr-svg variant for vector-graphic output. One of the most-downloaded document-parsing models on HuggingFace (579K+ downloads within months of release). MIT-licensed.

Model Details

Architecture DENSE
Parameters 3B
License MIT
ocrvisionopen-weight

Related