| Characteristic | Count |
|---|---|
| Number of unique form templates | 50 |
| Number of participants (writers) | 650 |
| Forms filled per participant | 1-2 |
| Total handwritten filled forms | 650 |
| Scanned high-resolution forms | 650 |
| Captured mobile images (7-10 per form) | 5,350 |
| Total dataset size (images) | 6,000 |
| Method | WRR | CRR | P | R | F1 |
|---|---|---|---|---|---|
| Google Form Parser | 92.14 | 96.38 | 88.90 | 90.45 | 89.67 |
| Azure Form Recognizer | 93.29 | 97.25 | 91.12 | 92.60 | 91.85 |
| PaddleOCR | 35.23 | 64.03 | 52.21 | 48.70 | 50.40 |
| DocTR | 32.22 | 65.44 | 50.93 | 46.28 | 48.50 |
| Donut | 65.33 | 70.12 | 66.45 | 67.21 | 66.83 |
| Naugat | 76.13 | 83.39 | 77.90 | 79.15 | 78.52 |
| FormLens (ours) | 95.44 | 98.33 | 94.12 | 95.31 | 94.71 |
Try our FormLens model live! Upload your handwritten forms and see the results instantly.
🚀 Try Live DemoIf you use FormLens in your research, please cite our paper:
@inproceedings{bhattacharyya2025formlens,
title = {FormLens: From Ink to Insight with Adapting Vision-Language Models for Handwritten Form Digitization},
author = {Shaon Bhattacharyya and Ajoy Mondal and C. V. Jawahar},
booktitle = {Proceedings of the 15th Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP)},
year = {2025},
address = {IIT Mandi, India},
pages = {1--8},
organization = {ACM / Springer},
institution = {Centre for Visual Information Technology (CVIT), IIIT Hyderabad},
note = {Accepted paper, 8 pages, double-column format}
}
© 2025 FormLens Project. All rights reserved.