Close Menu

    Subscribe to Updates

    Get the latest news information from worldwide businesses.

    What's Hot

    2026 F1 Monaco GP Qualifying: Winners and Losers

    June 6, 2026

    When public charging stations aren’t so public, and why it matters

    June 6, 2026

    Kerala Pharma Trade Raises Alarm Over Cross-Border Discount Racket, Fake Medicine Threat

    June 6, 2026
    Facebook Instagram YouTube LinkedIn X (Twitter)
    Trending
    • 2026 F1 Monaco GP Qualifying: Winners and Losers
    • When public charging stations aren’t so public, and why it matters
    • Kerala Pharma Trade Raises Alarm Over Cross-Border Discount Racket, Fake Medicine Threat
    • Meta made its own AI-generated clickbait news feed
    • Maruti, Tata lead hatchback revival as India’s carmakers rediscover the mass market
    • Ahead of INDIA bloc meet, CPM asks Congress to ‘clear air’ on Kerala poll remarks on ‘deal’ with BJP | India News
    • Starting Order & Pole for DQS Solutions & Staffing 250
    • WB JELET 2026 Admit Cards Released for Pharmacy Lateral Entry Admissions, Exam on June 13
    Newspublicly
    • About Us
    • Advertise & Partner with us
    • Pitch Your Story
    • Contact Us
    Facebook Instagram LinkedIn X (Twitter)
    Subscribe
    • Home
    • World News
      • Asia
      • India
      • USA
      • UK & Europe
      • Middle East
    • Economy & Business
      • Global Economy
      • Corporate & Industry
      • Finance & Markets
      • Policy & Trade
    • Technology
      • Gadgets & Devices
      • Software & Apps
      • AI & Machine Learning
      • Robotics & Automation
    • Health & Medicine
      • Fitness & Nutrition
      • Research & Innovation
      • Disease & Treatment
      • Doctors, Clinics & Patient Care
    • Travel & Tourism
    • Automobile
      • Electric & Hybrid Vehicles
      • Auto Industry Insights
    • Sports
    • More
      • Education
      • Real Estate
      • Environment & Climate
      • Space & Astronomy
      • War & Conflicts
    Newspublicly
    Home»Technology»Software & Apps»New Microsoft tool lets devs spin up AI behavior tests using text descriptions
    Software & Apps

    New Microsoft tool lets devs spin up AI behavior tests using text descriptions

    AdminBy AdminJune 2, 2026No Comments3 Mins Read0 Views
    Share
    Facebook Twitter LinkedIn Copy Link WhatsApp


    AI researchers and labs have advanced by leaps and bounds in evaluating AI models for everything from safety and compliance to sycophancy and alignment. But it appears companies and developers are faced with a new, specific need: making sure their AI system behaves as intended for their specific product or service.

    In a bid to make that testing process simpler, Microsoft on Tuesday took the wraps off ASSERT, short for Adaptive Spec-driven Scoring for Evaluation and Regression Testing.

    The open source framework, Microsoft says, makes evaluating application-specific AI behavior easy by using AI to turn high-level, natural-language descriptions of goals, policies, or intended behaviors into thorough, scored tests that can be investigated.

    ASSERT takes plain-language descriptions of an AI model’s expected behavior and policies, turns them into a structured set of acceptable and unacceptable behaviors, generates problem scenarios and test cases, runs them against the target system, and scores the results. It can also record the paths the AI system takes, including intermediate actions and tool calls, so developers can inspect where failures happen.

    Devs can provide system context, tools, and constraints, too, if they want to further customize what the evaluations cover.

    For example, a developer could specify that a document research AI agent shouldn’t send emails to people outside the company, and it should limit confidential information to C-level executives and provide concise summaries with prior context in mind. ASSERT will use those rules to generate test cases that check whether the system follows those rules on an ongoing basis.

    Image Credits:Microsoft

    The framework, according to Microsoft, fills a gap that broader, more general evaluations cannot when AI models are intended to behave in a manner that is shaped by an application or product’s context, policies, and tools.

    “One of the things we’ve learned is that evaluations are absolutely critical to making good decisions,” said Sarah Bird, chief product officer of Responsible AI at Microsoft. “Because if you don’t understand the behavior of the AI system, it’s really hard to know if it’s meeting your organization’s bar … What we found is that if you really want to have a trustworthy system, you should evaluate many more dimensions that are application-specific.”

    Bird said ASSERT can be used to evaluate systems when they’re being built, after deployment, and even for continuous monitoring.

    The release comes amidst a gradual but broader shift in the AI industry. As models grow more capable, researchers are focusing on repeatable testing and regression checks, with Stanford’s HELM, MLCommons’ AILuminate, and evaluation groups like METR rolling out benchmarks to measure how models behave under different conditions.

    When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.



    Source link

    Author

    • Admin

      NewsPublicly.com is News & Articles Platform that creating SEO-focused articles on travel, lifestyle, and digital trends.

    Admin
    • Website

    NewsPublicly.com is News & Articles Platform that creating SEO-focused articles on travel, lifestyle, and digital trends.

    Related Posts

    OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

    June 6, 2026

    Sriram Krishnan is leaving his role as White House AI advisor

    June 6, 2026

    What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

    June 6, 2026
    Leave A Reply Cancel Reply

    Demo
    Top Posts

    The Blue Moon rises on May 30— Where and when to see the second full moon of the month

    May 30, 202640 Views

    New SOCOM rifle allows barrel swapping and cartridge changes

    June 1, 202632 Views

    “Inside Gemini Robotics 1.5: How Robots Learn to Reason & Act

    November 22, 202525 Views

    525 pounds of cocaine seized after Nebraska K9 alerts troopers on I-80

    May 28, 202624 Views
    Don't Miss

    2026 F1 Monaco GP Qualifying: Winners and Losers

    June 6, 20264 Mins Read0 Views

    The F1 Monaco GP qualifying saw Kimi Antonelli pick up his first pole position around…

    When public charging stations aren’t so public, and why it matters

    June 6, 2026

    Kerala Pharma Trade Raises Alarm Over Cross-Border Discount Racket, Fake Medicine Threat

    June 6, 2026

    Meta made its own AI-generated clickbait news feed

    June 6, 2026
    Stay In Touch
    • Facebook
    • Twitter
    • Instagram
    • YouTube
    • LinkedIn
    • WhatsApp

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    Demo
    NEWSPUBLICLY
    Facebook X (Twitter) Instagram LinkedIn

    Home

    • About Us
    • Leadership
    • Advertise & Partner With Us
    • Pitch Your Story
    • Media Kit & Pricing
    • Career
    • FAQs

    Guidelines

    • Editorial & Submission
    • Partnership
    • Advertising & Sponsor
    • Intellectual Property Policy
    • Community & Comment
    • Security & Data Protection
    • Send Your Opinion

    Quick Links

    • Cookie Policy
    • Payment & Billing Terms
    • Refund & Cancellation
    • Copyright Policy
    • Complaint & Support
    • Sitemap
    • Contact Us

    Subscribe Us

    Get the latest news and updates!

    Copyright © 2026 Newspublicly (DIGITALIX COMMUNICATION). All Rights Reserved.
    • Privacy Policy
    • Terms of Use
    • Disclaimer