Drennn Posted March 7 Posted March 7 OpenAI's release of GPT-4.5 for ChatGPT Plus subscribers this week had me immediately keen to try it out, though with some skepticism. There had been reports of OpenAI (and other developers) struggling to make the big improvements to their models we've seen before. And the current standard model GPT-4o, is pretty good in most cases. And if you want a comprehensive report, there's the Deep Research feature. OpenAI claims that GPT-4.5 has a high emotional intelligence and nuanced understanding of what you say to it. The company's description painted a picture of both models as reliable friends, but that GPT-4.5 would be the one you'd expect a book of poetry from on your birthday. So, I decided to test 4.5 against 4o with a few prompts that any casual ChatGPT user might deploy. With that analogy in my head, I decided to start with a poetic challenge. I asked both models, "Can you write me a short poem about a rainy afternoon in New York City and make an image for it?" It seemed fair since looking out a rain-streaked window at a busy city can bring out the poet in most people. GPT-4.5 is on the left and GPT-4o is on the right. They are amazingly similar. I personally think GPT-4.5 did a slightly better job with similar ideas. It's evocative of not just the look of rain but the feeling of gray skies, puddles, and traffic among the raindrops. In a blind test of three random friends, two out of three chose the same, with the third saying they just preferred the rhyme scheme of GPT-4o. As for the images, both models used DALL-E 3, but GPT-4.5's looks a lot more realistic. I actually prefer the impressionistic lighting of GPT-4o's attempt, but both get the idea of the poem across pretty well. Both had the right answer of Michelle Yeoh for "Everything Everywhere All at Once," but GPT-4.5 had a really nice explanation why her performance resonated with viewers. It covered her performance and mentioned how Yeoh was the first Asian to win that Oscar. GPT-4o's answer had a lot of the same beats, but it went with an odd essay and numbered list format that was kind of annoying to read when the question was a simple opinion request. GPT-4.5's answer felt more like how an actual human would answer, albeit one who is very into that movie and Yeoh as a performer. Link: https://www.techradar.com/computing/artificial-intelligence/chatgpt-4-5-understands-subtext-but-it-doesnt-feel-like-an-enormous-leap-from-chatgpt-4o
Recommended Posts