Sure, Gemini can directly output formats like HTML and JSON, but the tradeoff is that if you require more complex functionality, there’s a high chance it could break something, leading to more errors, etc.
I think for OCR, text accuracy is the most important consideration. Also, since this dictionary has a relatively small vocabulary, it’s easy to fix formatting issues on your own.