Vietnam.vn - Nền tảng quảng bá Việt Nam

CMC reaches world top 12 in text recognition

The CATI-VLM (Visual Document Understanding) model developed by CMC Technology Application Institute (CMC ATI) has surpassed many international competitors to reach the top 12 in the world and top 1 in Vietnam in the rankings recently announced by Robust Reading Competition (RRC) in June 2025 in the Document Visual Question Answering (DocVQA) category.

Báo Nhân dânBáo Nhân dân02/07/2025

RRC ranking in DocVQA category 6/2025.
RRC ranking in DocVQA category 6/2025.

In the context of digital transformation and artificial intelligence application transformation in Vietnam, OCR technology (Optical Character Recognition) plays an increasingly important role in digitizing documents, automating business processes, saving costs and improving management efficiency. However, with the characteristics of Vietnamese with accents and handwriting, the recognition problem does not stop at 'reading words', but requires the model to have the ability to understand the context comprehensively.

Recently, CMC Technology Application Institute (CMC ATI) announced the CATI-VLM (Visual Document Understanding) model - developed by the research team from a 5TB large data warehouse, surpassing many international competitors to reach the top 12 in the world and top 1 in Vietnam in the rankings just announced by Robust Reading Competition (RRC) in June 2025 in the Document Visual Question Answering (DocVQA) category.

Robust Reading Competition (RRC) is a prestigious scientific playground, (https://rrc.cvc.uab.es/) organized by the Computer Vision Center (CVC) of the Universitat Autònoma de Barcelona (UAB) Spain, a prestigious research facility in the world in the field of computer vision.

The competition was initiated in 2011 and is held annually within the framework of the International Conference on Text Analysis and Recognition (ICDAR) – one of the world’s leading forums in the field of computer vision. The competition attracts a large number of researchers and engineers from universities, research institutes and large technology corporations such as Tsinghua University, Hyundai Motor Group, Tencent, etc. RRC’s problems are designed to promote technological progress, closely linked to practical problems from translation, enterprise data management to urban analysis and historical document processing.

Dr. Dang Minh Tuan, Director of CMC ATI shared: "We are very pleased that the research capacity of the CMC team has been affirmed through a prestigious global playground like RRC. In just a short time, the research team has achieved high rankings, demonstrating its international competitiveness with big names from developed countries. More importantly, this is a clear demonstration of the ability to master technology to solve specific problems of Vietnamese and specialized fields in Vietnam."

z6764757325423-eeef2a0ed90465644555dcab3096c25c.jpg
Dr. Dang Minh Tuan, Director of CMC ATI.

CATI-VLM differs from traditional OCR in that it not only extracts characters, but also understands multiple layers of information: text content, non-text elements (tick boxes, checkboxes, charts, signatures, formulas), layout (page structure, tables, forms) and style (fonts, highlights, etc.). The model can answer visual questions posed on document images, similar to ChatGPT, without having to learn specific forms in advance.

Notably, on the RRC rankings, CATI-VLM with only 3 billion parameters achieved the highest accuracy in 4/7 datasets, surpassing many Big Tech models such as Deepseek (27 billion parameters), GPT-4 Vision Turbo + Amazon Textract OCR (top 34) or Baidu (top 22).

The achievement also shows a practical approach, focusing on mastering core technology, optimizing the model to suit Vietnam's infrastructure conditions instead of chasing parameter scale.

hinh-2.jpg
Sample College Admissions Application Form
hinh-3.jpg
Text has been recognized from handwriting in the image above.

Mr. Nguyen Trung Chinh, Chairman of the Board of Directors, Executive Chairman of CMC Technology Group, emphasized: "This is the result of more than a decade of persistent investment in technology research and development (R&D). CMC's high achievements at the international technology playground affirm the strategy of mastering Vietnamese technology, coupled with the orientation of AI Transformation and entering the global market. We believe that Vietnamese intelligence is fully capable of standing shoulder to shoulder with global Big Tech, creating a worthy position on the world technology map."

CATI-VLM will be applied in the product chain of the C.OpenAI ecosystem, including: CLS virtual assistant for reviewing legal documents, CMC SmartDoc - digital document conversion platform, CMC KMS knowledge management system, automatic reporting system for smart offices and new generation Agentic Documents applications.

Source: https://nhandan.vn/cmc-dat-top-12-the-gioi-ve-nhan-dang-van-ban-post891252.html


Comment (0)

No data
No data
Watch the sparkling Quy Nhon coastal city of Gia Lai at night
Image of terraced fields in Phu Tho, gently sloping, bright and beautiful like mirrors before the planting season
Z121 Factory is ready for the International Fireworks Final Night
Famous travel magazine praises Son Doong cave as 'the most magnificent on the planet'
Mysterious cave attracts Western tourists, likened to 'Phong Nha cave' in Thanh Hoa
Discover the poetic beauty of Vinh Hy Bay
How is the most expensive tea in Hanoi, priced at over 10 million VND/kg, processed?
Taste of the river region
Beautiful sunrise over the seas of Vietnam
The majestic cave arc in Tu Lan

Heritage

Figure

Business

No videos available

News

Political System

Local

Product