Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Existing ocr doesn’t skip over entire (legible) paragraphs or hallucinate entire sentences


I usually run the image(s) through more than one converter then compare the results. They all have problems, but the parts they agree on are usually correct.


rarely happens to me using LLMs to transcribe pdfs


This must be some older/smaller model.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: