Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Gemini does but it's not as good as Google vision, and the format it's différent Here it's the documentation https://cloud.google.com/vertex-ai/generative-ai/docs/boundi...

Also Simon Willison Made a blog post that might be helpful https://simonwillison.net/2024/Aug/26/gemini-bounding-box-vi...

I hope that this capability improves so I can use only Gemini API.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: