Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I attempted OCR using all of the open source models available about 3 months ago, including Llama 4. These were pngs of text using a regular font. Most produced garbage except Llama 4, and even then it was only about 90% accurate. Using OpenAI or Gemini produced much better results but the open source models were really bad.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: