Close Menu

    Subscribe to Updates

    Get the latest news information from worldwide businesses.

    What's Hot

    Thinking Machines wants to build an AI that actually listens while it talks

    May 12, 2026

    JUPITER supercomputer breaks world record with 50-qubit quantum simulation

    May 12, 2026

    Business News Today: Stock and Share Market News, Economy and Finance News, Sensex, Nifty, Global Market, NSE, BSE Live IPO News

    May 12, 2026
    Facebook Instagram YouTube LinkedIn X (Twitter)
    Trending
    • Thinking Machines wants to build an AI that actually listens while it talks
    • JUPITER supercomputer breaks world record with 50-qubit quantum simulation
    • Business News Today: Stock and Share Market News, Economy and Finance News, Sensex, Nifty, Global Market, NSE, BSE Live IPO News
    • Teen bedroom art installation shines spotlight on Ukraine’s stolen children | Ukraine
    • Derogatory Remarks Against Pm Modi: FIR filed against SP MP Ajendra Singh Lodhi over derogatory remarks against PM Modi | India News
    • Trump rushed to safety after shooting at WHCA Dinner
    • Black holes slamming into scorching stars may be causing mysterious blue flashes in the cosmos
    • Avalanche bounce back to beat Wild, go up 3-1
    Newspublicly
    • About Us
    • Advertise & Partner with us
    • Pitch Your Story
    • Contact Us
    Facebook Instagram LinkedIn X (Twitter)
    Subscribe
    • Home
    • World News
      • Asia
      • India
      • USA
      • UK & Europe
      • Middle East
    • Economy & Business
      • Global Economy
      • Corporate & Industry
      • Finance & Markets
      • Policy & Trade
    • Technology
      • Gadgets & Devices
      • Software & Apps
      • AI & Machine Learning
      • Robotics & Automation
    • Health & Medicine
      • Fitness & Nutrition
      • Research & Innovation
      • Disease & Treatment
      • Doctors, Clinics & Patient Care
    • Travel & Tourism
    • Automobile
      • Electric & Hybrid Vehicles
      • Auto Industry Insights
    • Sports
    • More
      • Education
      • Real Estate
      • Environment & Climate
      • Space & Astronomy
      • War & Conflicts
    Newspublicly
    Home»Technology»Robotics & Automation»Study finds ChatGPT gets science wrong more often than you think
    Robotics & Automation

    Study finds ChatGPT gets science wrong more often than you think

    Divya SharmaBy Divya SharmaMay 12, 2026No Comments4 Mins Read0 Views
    Share
    Facebook Twitter LinkedIn Copy Link WhatsApp


    Washington State University professor Mesut Cicek and his research team repeatedly tested ChatGPT by giving it hypotheses taken from scientific papers. The goal was to see if the AI could correctly determine whether each claim was supported by research or not — in other words, whether it was true or false.

    In total, the team evaluated more than 700 hypotheses and asked the same question 10 times for each one to measure consistency.

    Accuracy Results and Limits of AI Performance

    When the experiment was first conducted in 2024, ChatGPT answered correctly 76.5% of the time. In a follow-up test in 2025, accuracy rose slightly to 80%. However, once the researchers adjusted for random guessing, the results looked far less impressive. The AI performed only about 60% better than chance, a level closer to a low D than to strong reliability.

    The system had the most difficulty identifying false statements, correctly labeling them only 16.4% of the time. It also showed notable inconsistency. Even when given the exact same prompt 10 times, ChatGPT produced consistent answers only about 73% of the time.

    Inconsistent Answers Raise Concerns

    “We’re not just talking about accuracy, we’re talking about inconsistency, because if you ask the same question again and again, you come up with different answers,” said Cicek, an associate professor in the Department of Marketing and International Business in WSU’s Carson College of Business and lead author of the new publication.

    “We used 10 prompts with the same exact question. Everything was identical. It would answer true. Next, it says it’s false. It’s true, it’s false, false, true. There were several cases where there were five true, five false.”

    AI Fluency vs. Real Understanding

    The findings, published in the Rutgers Business Review, highlight the importance of using caution when relying on AI for important decisions, especially those that require nuanced or complex reasoning. While generative AI can produce smooth, convincing language, it does not yet demonstrate the same level of conceptual understanding.

    According to Cicek, these results suggest that artificial general intelligence capable of truly “thinking” may still be further away than many expect.

    “Current AI tools don’t understand the world the way we do — they don’t have a ‘brain,'” Cicek said. “They just memorize, and they can give you some insight, but they don’t understand what they’re talking about.”

    Study Design and Methods

    Cicek worked with co-authors Sevincgul Ulu of Southern Illinois University, Can Uslay of Rutgers University, and Kate Karniouchina of Northeastern University.

    The team used 719 hypotheses from scientific studies published in business journals since 2021. These types of questions often involve nuance, with multiple factors influencing whether a hypothesis is supported. Reducing such complexity to a simple true or false judgment requires careful reasoning.

    The researchers tested the free version of ChatGPT-3.5 in 2024 and the updated ChatGPT-5 mini in 2025. Overall, performance remained similar across both versions. After adjusting for random chance, which gives a 50% probability of a correct answer, the AI’s effectiveness was only about 60% above chance in both years.

    Key Weakness in AI Reasoning

    The results point to a fundamental limitation of large language model AI systems. Although they can generate fluent and persuasive responses, they often struggle to reason through complicated questions. This can lead to answers that sound convincing but are actually incorrect, Cicek said.

    Why Experts Urge Caution With AI

    Based on these findings, the researchers recommend that business leaders verify AI-generated information and approach it with skepticism. They also emphasize the need for training to better understand what AI systems can and cannot do effectively.

    Although this study focused specifically on ChatGPT, Cicek noted that similar experiments with other AI tools have produced comparable outcomes. The work also builds on earlier research pointing to caution around AI hype. A 2024 national survey found that consumers were less likely to purchase products when they were marketed with a focus on AI.

    “Always be skeptical,” he said. “I’m not against AI. I’m using it. But you need to be very careful.”



    Source link

    Divya Sharma
    • Website

    Divya Sharma is a content writer at NewsPublicly.com, creating SEO-focused articles on travel, lifestyle, and digital trends.

    Related Posts

    JUPITER supercomputer breaks world record with 50-qubit quantum simulation

    May 12, 2026

    AI-powered robot learns how to harvest tomatoes more efficiently

    May 12, 2026

    AI uses as much energy as Iceland but scientists aren’t worried

    May 12, 2026
    Leave A Reply Cancel Reply

    Demo
    Top Posts

    “Inside Gemini Robotics 1.5: How Robots Learn to Reason & Act

    November 22, 202524 Views

    How US Tariffs Are Reshaping the Global Growth Landscape?

    November 21, 202518 Views

    Pakistani Journalist Laughing at Tejas Fighter Jet Crash at Dubai Airshow Sparks Massive Outrage Worldwide

    November 23, 202517 Views

    Vibe-Coding Boom: How Non-Coders Build Apps With AI Agents

    November 22, 202515 Views
    Don't Miss

    Thinking Machines wants to build an AI that actually listens while it talks

    May 12, 20262 Mins Read0 Views

    Thinking Machines Lab, the AI startup founded last year by former OpenAI CTO Mira Murati,…

    JUPITER supercomputer breaks world record with 50-qubit quantum simulation

    May 12, 2026

    Business News Today: Stock and Share Market News, Economy and Finance News, Sensex, Nifty, Global Market, NSE, BSE Live IPO News

    May 12, 2026

    Teen bedroom art installation shines spotlight on Ukraine’s stolen children | Ukraine

    May 12, 2026
    Stay In Touch
    • Facebook
    • Twitter
    • Instagram
    • YouTube
    • LinkedIn
    • WhatsApp

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    Demo
    NEWSPUBLICLY
    Facebook X (Twitter) Instagram LinkedIn

    Home

    • About Us
    • Leadership & Certification
    • Advertise & Partner With Us
    • Pitch Your Story
    • Media Kit & Pricing
    • Career
    • FAQs

    Guidelines

    • Editorial & Submission
    • Partnership
    • Advertising & Sponsor
    • Intellectual Property Policy
    • Community & Comment
    • Security & Data Protection
    • Send Your Opinion

    Quick Links

    • Cookie Policy
    • Payment & Billing Terms
    • Refund & Cancellation
    • Copyright Policy
    • Complaint & Support
    • Sitemap
    • Contact Us

    Subscribe Us

    Get the latest news and updates!

    Copyright © 2026 Newspublicly (DIGITALIX COMMUNICATION). All Rights Reserved.
    • Privacy Policy
    • Terms of Use
    • Disclaimer