OpenAI recently released o1 and o1 pro in their 12 Days of OpenAI – Live updates, offering unlimited access through a $200 ChatGPT Pro subscription. With much speculation surrounding their capabilities, I wondered – Is this premium subscription worth the investment? To answer this, I pitted these two AI models against each other in six challenging tasks. This article explores their strengths, weaknesses, and overall performance. By the end, you’ll have a clear understanding of whether the $200 ChatGPT Pro subscription is the right choice for you or not!
As the first test of o1 vs o1 pro, I am taking a zebra problem – hard level from this website. Let’s see which one cracks it better!
Prompt:
Solve this zebra problem:
o1 Resonse:
Putting this response as the solution, it turns out to be incorrect:
o1 pro Response:
Putting this response as the solution, it turns out to be correct:
Observation:
o1 pro took much more time than o1 to respond. o1 fails to solve the problem, whereas o1 pro succeds!
Verdict:
o1 ❌ | o1 pro ✅
Prompt:
Find 3 differences in the two images:
o1 Response:
Observation:
Only the second difference is correct which is the hair accessory missing, rest 2 are incorrect.
o1 pro Response:
Observation:
Only the first difference is correct. Rest 2 are incorrect.
Both o1 and o1 pro were not able to respond correctlty. However, o1 was faster in generating the response.
Verdict:
o1 ❌ | o1 pro ❌
In this challenge, I will be giving a computing Indefinite Integrals problem to the models. Let’s see which one is able to solve it!
Prompt:
Solve this math problem:
o1 and o1 pro Response (Same):
Observation:
Both the models provided the correct answer but the o1 was much faster than o1 pro in finding the solution.
Verdict:
o1 ✅ | o1 pro ✅
Prompt:
Read the article – https://www.analyticsvidhya.com/blog/2024/07/building-agentic-rag-systems-with-langgraph/ to understand the process of building a vector database for Wikipedia data. Summarize the key steps in a concise manner.
o1 Response:
o1 pro Response:
Observation:
The “o1 pro response” is closer to the actual implementation in the article. Here’s why:
The article provides a much more detailed, step-by-step implementation involving:
The o1 pro response captures more nuance by mentioning:
By contrast, the initial “o1 response” is more generic and lacks the technical depth demonstrated in the article. So the o1 pro response is significantly closer to the article’s actual implementation.
Verdict:
o1 ❌ | o1 pro ✅
Prompt:
Create an image of a cat.
o1 Response:
o 1 pro Response:
Observation:
Both o1 and o1 pro were not able to generated images indicating both the o1 versions do not support image generation. However, on giving the same prompt to GPT 4o, I got the response:
Hence, it is safe to say that only GPT 4o is beating both o1 and o1 pro in image generation!
Verdict:
o1 ❌ | o1 pro ❌
Prompt:
Create a comprehensive flow chart illustrating the Reflection Pattern in Agentic AI.
o1 Response:
o1 pro Response:
Both provided incomplete flow chats, so I decided to update my prompt. Here’s my updated prompt:
New Prompt:
These are the steps involved in reflection patter –
o1 Response:
o1 pro Response:
Observation:
Even though the content in both the responses is the same, o1 is definetly winning by providing an actual flow chart, whereas o1 pro only provided the correct content.
Verdict:
o1 ✅ | o1 pro ❌
Challenge | Verdict |
---|---|
Zebra Problem | o1 pro succeeded, but was slower |
Find Differences | Both models performed poorly |
Math Problem | Both solved correctly, o1 was faster |
Analyzing Article | o1 pro provided more depth |
Image Creation | Neither could generate images (GPT 4o could) |
Creating a Logical Flow Chart | o1 won by creating an actual flow chart |
o1 pro seems to have a slight edge in terms of problem-solving depth and accuracy, particularly in complex tasks like solving the zebra problem and analyzing technical articles. However, o1 tends to be faster and performs well in simpler tasks.The verdict appears to be that o1 pro is marginally better, especially for more complex or technical challenges that require deeper understanding.
Also Read: Is the New o1 Model Better than GPT-4o?
While o1 pro shows promise in complex problem-solving, it’s important to consider your specific needs and budget. For basic to intermediate tasks, GPT-4o or other more affordable alternatives might suffice. If complex problem-solving is a priority and you’re willing to invest, o1 pro could be a valuable tool.
However, given that OpenAI is continually refining these models, it might be wise to wait for further updates before making a definitive decision. OpenAI is likely to add more benefits to the $200 ChatGPT Pro plan in the future.
What are your thoughts on this? Let me know in the comment section below.
Stay tuned to Analytics Vidhya Blog for more such awesome updates!