Загрузка видео...
Не удалось загрузить видео
Fascinating how hard it still is even for o3 to solve a seemingly simple problem like answering "what time is it" based on this image of a clock with reflections Btw, in case you were wondering, the "image analysis" (cropping, zooming, etc.) that o3 is doing uses the Python... show more
35,359 просмотров • 1 год назад •via X (Twitter)
Комментарии: 11

It's not a simple task. I've never been able to do it without extensive thinking. I hate analog clocks

Have you tried running with o4-mini? There was a post yesterday from an OAI employee saying that o4-mini is better for visual tasks.

I am not sure about that (o4-mini-high thinks it's 9:50)

Curious about jobless claims and their impact on the economy? Dive into my latest free Substack post to explore how Python and FRED can analyze this key indicator. Discover actionable insights that can empower your data-driven decisions.

almost!

It got it right first time for me.

What is this real?

Yeah. And some people call it "more intelligent than" they are or even AGI 💀

It doesn’t surprise me — if he doesn’t check the date, he stays convinced it’s still June 2024.

It may be counter intuitive how such advanced model that can argue about the most advanced science has trouble with some simple tasks. It comes from the technology itself. It is weak on what is called spatio-temporal reasoning. For this was created SimpleBench.

Yeah there seems to be a bit of sloppiness to it
