This page contains press release content distributed by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AI Built for Law Outperforms ChatGPT, Claude, and Gemini on Legal Reasoning Benchmark

DescrybeLM answered all 200 bar exam questions correctly. ChatGPT, Claude, and Gemini each missed between 13 and 23—and scored lower on legal reasoning quality.

We had a thesis that purpose-built legal AI produces meaningfully different results. Legal professionals deserve evidence. So we tested ourselves and published our methodology for anyone to replicate.”
— Kara Peterson, Co-Founder and CEO of Descrybe

BOSTON, MA, UNITED STATES, March 5, 2026 /EINPresswire.com/ — When AI gets a legal question wrong, the most dangerous failure isn’t an obvious error. It’s an answer that sounds authoritative: fluent, confident, well-structured, and yet applying the wrong legal standard. The error reads like competent lawyering.

Today, Descrybe launched DescrybeLM — an AI system built specifically for legal reasoning — and published a white paper with benchmark data to show what that difference looks like in practice.

Descrybe ran a controlled benchmark against ChatGPT 5.2, Claude Opus 4.5, and Gemini 3 Pro on 200 multistate bar exam questions. The study measured not just whether each system chose the correct answer, but whether the legal reasoning behind it was sound: Did it identify the right rule? Apply it correctly to the facts? Avoid the traps that produce persuasive but wrong analysis?

“We had a thesis that purpose-built legal AI produces meaningfully different results for legal reasoning tasks. Legal professionals deserve to make tool decisions based on real evidence. So we tested ourselves, published our methodology, and invite anyone to replicate it,” said Kara Peterson, Co-Founder and CEO of Descrybe.

What the benchmark showed

All four systems were tested under standardized, no-external-web conditions using the NCBE MBE Complete Practice Exam (Questions 1–200, no exclusions), producing 800 separate evaluation runs with blinded scoring.

When general-purpose models were wrong, they were confidently wrong. Among 52 incorrect outputs, 49 delivered assertive, well-structured reasoning that did not signal uncertainty — the failure mode that imposes the highest verification burden on practitioners. The dominant patterns were applying the wrong legal standard or misapplying the correct one, while the prose read like competent analysis.

Two models — Claude Opus 4.5 and Gemini 3 Pro — exhibited overconfident tone on correct outputs as well as incorrect ones. DescrybeLM and ChatGPT 5.2 received zero overconfidence flags across all 200 outputs. A system that sounds equally confident whether it is right or wrong gives practitioners no reliable signal from tone alone.

The study also found that cross-checking between general-purpose models is not a reliable substitute for getting the answer right. Across 200 questions, 40 were missed by at least one model, 11 by two or more, and only 1 by all three — meaning errors were largely unpredictable and non-overlapping.

What’s behind the results

DescrybeLM is built on a curated primary-law corpus of more than 100 million structured records, requiring more than 100 billion tokens of preparation.
“Most AI tools are built for general use and adapted for law. DescrybeLM was built differently: from the foundation up, specifically for legal reasoning, on more than 100 million structured records individually cleaned and organized for that purpose. That kind of data work is painstaking and takes years — but it’s the difference between a system that sounds right and one that is right,” said Richard DiBona, Co-Founder and CTO of Descrybe.

Why this matters

The headline problem in legal AI isn’t systems that obviously fail. It’s systems that fail invisibly, confidently, and in a way that reads like competent analysis. In a crowded market, sounding right is easy to mistake for being right. Legal professionals need real evidence to decide which tools to use for which purposes — which is why Descrybe published its methodology and invites independent replication.

“It’s rare to see something that genuinely stops you in your tracks. When I saw DescrybeLM answer all 200 multistate bar exam questions correctly while ChatGPT, Claude, and Gemini each missed double digits — that’s not a marginal difference. That’s a different category of tool,” said Ken Friedman, legal technology pioneer and advisor to Descrybe.

The full white paper, Beyond Confidently Wrong: How Purpose-Built AI Mitigates Legal Reasoning’s Hidden Risk, is available now.

Kara Peterson
Descrybe
+1 617-752-2020
email us here
Visit us on social media:
LinkedIn
YouTube

Descrybe demo

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Anthony Nayagan Named 2026 IOFP Wizard of Creative & Innovative Insightfulness Redefining Leadership Through Pure Wisdom

Anthony Nayagan Named 2026 IOFP Wizard of Creative & Innovative Insightfulness Redefining Leadership Through Pure Wisdom

JACKSONVILLE, FL, UNITED STATES, March 9, 2026 /EINPresswire.com/ — Anthony Nayagan, a Tamil Siddhar, secular mystic,

March 9, 2026

EACC Launches International Fans Tournaments for Summer of Soccer 2026 in Dallas

EACC Launches International Fans Tournaments for Summer of Soccer 2026 in Dallas

Over 2,000 Players, 9 Tournaments, 9 Champions – A Celebration of Soccer and International Connection Success is no

March 9, 2026

Affordable Roofers Announces Expanded Roofing Services for Homeowners and Businesses in Los Angeles

Affordable Roofers Announces Expanded Roofing Services for Homeowners and Businesses in Los Angeles

Affordable Roofers delivers trusted roof repair, replacement, installation, and inspection services for residential and

March 9, 2026

Men’s Health Network Joins National Coalitions Supporting Bladder Cancer Research & Strengthening the Nursing Workforce

Men’s Health Network Joins National Coalitions Supporting Bladder Cancer Research & Strengthening the Nursing Workforce

MHN has signed support of 2 policies: a Bladder Cancer Research Program within DoD & recognition of

March 9, 2026

Esomar appoints Barry Jennings as North America ambassador to strengthen regional presence

Esomar appoints Barry Jennings as North America ambassador to strengthen regional presence

Former Microsoft executive to collaborate with Esomar team and representatives to drive advocacy and member engagement

March 9, 2026

Interview with Maj. General Bentley Rayburn, USAF-Ret., STARRS Chairman of the Board

Interview with Maj. General Bentley Rayburn, USAF-Ret., STARRS Chairman of the Board

COLORADO SPRINGS, CO, UNITED STATES, March 9, 2026 /EINPresswire.com/ — STARRS & Stripes podcast host CDR Al

March 9, 2026

AMSYS Powers Agentic AI for City Operations Transformation at Smart City Connect 2026

AMSYS Powers Agentic AI for City Operations Transformation at Smart City Connect 2026

HOUSTON, TX, UNITED STATES, March 9, 2026 /EINPresswire.com/ — At Smart City Connect in Raleigh, AMSYS and Lenovo are

March 9, 2026

Las Vegas Blinds Expands Mobile Showroom Fleet to Summerlin & Henderson

Las Vegas Blinds Expands Mobile Showroom Fleet to Summerlin & Henderson

Top-rated Las Vegas window treatment specialists expand mobile showrooms across the valley, offering free in-home

March 9, 2026

Spirence Announces Strategic Advisory Committee As Pioneering Preventative Mental Health Platform Enters Growth Phase

Spirence Announces Strategic Advisory Committee As Pioneering Preventative Mental Health Platform Enters Growth Phase

This committee represents more than advisory support. It represents alignment at the highest levels of health care,

March 9, 2026

VESSL AI Showcases GPU Cloud Platform for Physical AI at NVIDIA GTC 2026

VESSL AI Showcases GPU Cloud Platform for Physical AI at NVIDIA GTC 2026

Company to demonstrate VESSL Cloud and large-scale training infrastructure for robotics and simulation-driven Physical

March 9, 2026

The Roxborough Group Appoints New Director of Investor Relations

The Roxborough Group Appoints New Director of Investor Relations

Industry Veteran Christine Schadlich Joins San Francisco Private Equity Firm Christine brings a strong track record in

March 9, 2026

RIVER HOUSE AT ODETTE’S LAUNCHES PRIVATE EXPERIENCE WITH FONTHILL CASTLE

RIVER HOUSE AT ODETTE’S LAUNCHES PRIVATE EXPERIENCE WITH FONTHILL CASTLE

MAJESTIC, EXCLUSIVE OPPORTUNITY FIT FOR ROYALTY NEW HOPE, PA, UNITED STATES, March 9, 2026 /EINPresswire.com/ — River

March 9, 2026

Millennials Are Hitting Midlife. Here’s Why Prevention Is the New Priority

Millennials Are Hitting Midlife. Here’s Why Prevention Is the New Priority

Experts say early attention to bone health, mobility, and strength is critical to long-term wellness. The Joint Corp.

March 9, 2026

Lime Trading Integrates Bookmap Order Flow Visualization Directly into Its Trading Platform

Lime Trading Integrates Bookmap Order Flow Visualization Directly into Its Trading Platform

Advanced market depth analysis is now accessible within Lime Trader, eliminating the need to switch between

March 9, 2026

Rocco’s Honda & Acura Specialists Expands Honda Oil Change Service Options

Rocco’s Honda & Acura Specialists Expands Honda Oil Change Service Options

Rocco’s Honda & Acura Specialists expands Honda oil change services, offering more options for drivers to maintain

March 9, 2026

Thistle Technologies Enables Production-Ready Secure Edge AI on IMDT Qualcomm QCS8550 SOMs

Thistle Technologies Enables Production-Ready Secure Edge AI on IMDT Qualcomm QCS8550 SOMs

Security must be integrated from the start. By working with Thistle Technologies, our customers can deploy secure and

March 9, 2026

GetOut Launches as the Largest Entertainment Membership of Its Kind in the United States

GetOut Launches as the Largest Entertainment Membership of Its Kind in the United States

GetOutPass and Pogo Pass unite to create one nationwide platform giving families access to more than 2,000 attractions

March 9, 2026

L.A. Superior Court Appoints Stephen J. Donell as Receiver for Partnership Assets, Case No. 23STFL03923

L.A. Superior Court Appoints Stephen J. Donell as Receiver for Partnership Assets, Case No. 23STFL03923

On 12/26/25, L.A. Superior Court appoints Stephen Donell to preserve and manage community and partnership assets across

March 9, 2026

Annapolis Film Festival Announces World Premiere of ELIJAH PEEL

Annapolis Film Festival Announces World Premiere of ELIJAH PEEL

A Powerful New Faith Film Anchors the Festival’s Signature Faith Experience The Faith Experience isn’t about promoting

March 9, 2026

ERIC ROBERTS, GOLDEN BROOKS, RUTA LEE & KATHY GARVER TO BE HONORED AT CHARMAINE BLAKE’S OSCAR VIEWING GALA

ERIC ROBERTS, GOLDEN BROOKS, RUTA LEE & KATHY GARVER TO BE HONORED AT CHARMAINE BLAKE’S OSCAR VIEWING GALA

BEVERLY HILLS, CA, UNITED STATES, March 9, 2026 /EINPresswire.com/ — CHARMAINE BLAKE PR FIRM PRESENTSSTAR-STUDDED RED

March 9, 2026

Retail Media’s Growth Masks Margin Pressure, New Survey Finds

Retail Media’s Growth Masks Margin Pressure, New Survey Finds

Feedvisor’s 2026 Brand Survey reveals rising budgets, tightening efficiency, AI's impact on commerce, and the growing

March 9, 2026

Jersey Mike’s Support Special Olympics Georgia in March 2026

Jersey Mike’s Support Special Olympics Georgia in March 2026

GA, UNITED STATES, March 9, 2026 /EINPresswire.com/ — The 2026 Special Olympics USA Games is joining forces with 132

March 9, 2026

IMPACCT Brooklyn Celebrated 62 Years at 2026 Emerald Ball To Serve, Preserve, and Protect Brooklyn Communities

IMPACCT Brooklyn Celebrated 62 Years at 2026 Emerald Ball To Serve, Preserve, and Protect Brooklyn Communities

Themed “IMPACCT. Ignite. Inspire.”, the Evening Honored Transformational Leaders and Advanced Affordable Housing

March 9, 2026

Off Leash K9 Training, Atlanta Expands Visibility for Professional Dog Training Programs in Atlanta, Georgia

Off Leash K9 Training, Atlanta Expands Visibility for Professional Dog Training Programs in Atlanta, Georgia

Company website highlights obedience, behavior-focused, puppy, Board & Train, therapy dog, and in-home training

March 9, 2026

Yuri Williams & AFutureSuperHero and Friends to Honor the Legacy of Shirley ‘Beauty2TheStreetz’ Raines on March 14, 2026

Yuri Williams & AFutureSuperHero and Friends to Honor the Legacy of Shirley ‘Beauty2TheStreetz’ Raines on March 14, 2026

LA community leader Yuri Williams, will be honoring his close friend Shirley Raines, better known as Beauty2TheStreetz

March 9, 2026

Cumberland Academy GA Opens Priority Enrollment & Tours

Cumberland Academy GA Opens Priority Enrollment & Tours

Cumberland Academy of Georgia in Atlanta opens priority enrollment and family tours for Grades 3–12 students with

March 9, 2026

The Brookbush Institute Publishes a NEW Course: ‘Strength Training: Evidence-based Model’

The Brookbush Institute Publishes a NEW Course: ‘Strength Training: Evidence-based Model’

The Brookbush Institute continues to enhance education with new articles, new courses, a modern glossary, an AI Tutor,

March 9, 2026

Malcolm Keith to Appear on Legacy Makers TV

Malcolm Keith to Appear on Legacy Makers TV

FL, UNITED STATES, March 9, 2026 /EINPresswire.com/ — Malcolm Keith, entrepreneur, Certified Funnel Builder, and

March 9, 2026

Valley Forge Military College and Valley Forge Military Academy Name Maryland Governor Wes Moore as Commencement Speaker

Valley Forge Military College and Valley Forge Military Academy Name Maryland Governor Wes Moore as Commencement Speaker

Joint ceremony set for Saturday, May 9 at 10:00 a.m. on the Valley Forge Military campus in Wayne, Pa. WAYNE, PA,

March 9, 2026

Geechee Mama Introduces Hoodoo Inspired Ritual Candle Brand Focused on Intention and Spiritual Practice

Geechee Mama Introduces Hoodoo Inspired Ritual Candle Brand Focused on Intention and Spiritual Practice

Founder Monique Diaz creates handcrafted ritual candles in Hoodoo tradition and intentional spiritual practice. Hoodoo

March 9, 2026

44: The Musical Brings an Infusion of Joy to Washington, D.C.

44: The Musical Brings an Infusion of Joy to Washington, D.C.

Limited Engagement Opening April 23, 2026 at Shakespeare Theatre Company: Klein Theatre For any political comedy to

March 9, 2026

Protecting Trillions in Energy Assets: How SMX Technology Helps Defend Investment and ROI in Global Oil and Gas Supply Chains

Protecting Trillions in Energy Assets: How SMX Technology Helps Defend Investment and ROI in Global Oil and Gas Supply Chains

As geopolitical instability reshapes global trade routes and energy markets, molecular-level verification is emerging

March 9, 2026

West Coast Customs Starts 2026 with a Standout Project Collab Debuting at CES; Announces Top 5 Custom Builds of 2025

West Coast Customs Starts 2026 with a Standout Project Collab Debuting at CES; Announces Top 5 Custom Builds of 2025

In 2025 WCC completed builds for Fortune 100 Brands, a Centibillionaire, a Video Game Franchise and so far in '26 a

March 9, 2026

Hake’s March 24-25 Premier Auction features political memorabilia, key Golden Age comic books, Star Wars, G.I. Joe, more

Hake’s March 24-25 Premier Auction features political memorabilia, key Golden Age comic books, Star Wars, G.I. Joe, more

Featured: 1920 Cox/Roosevelt jugate button, 1936 FDR union campaign poster, 1872 Anna Pottery folk art pig, Captain

March 9, 2026

Viking Bags Drops Ultra-Tough BMW R1250 GS Top Case With Unrivaled Features

Viking Bags Drops Ultra-Tough BMW R1250 GS Top Case With Unrivaled Features

Viking Bags drops the ultra-tough ADV Aluminum Top Case for BMW R1250GS with a waterproof build, key lock, 4 tie-downs,

March 9, 2026

Wilmington, NC, Selected as National Host for Nuclear Science Week 2026 October 19-23, 2026

Wilmington, NC, Selected as National Host for Nuclear Science Week 2026 October 19-23, 2026

Wilmington ANS has been selected as the national host for Nuclear Science Week 2026, showcasing Wilmington, NC, as a

March 9, 2026

VFAF has Endorsed Burt Jones for Governor of Georgia said Stan Fitzgerald Veterans for America First GA State Chapter

VFAF has Endorsed Burt Jones for Governor of Georgia said Stan Fitzgerald Veterans for America First GA State Chapter

Burt Jones is endorsed by President Donald Trump and Veterans for America First for Governor of Georgia said Stan

March 9, 2026

ByeTruck Launches Direct Purchase Platform for Semi-Trucks, Vans, and Commercial Vehicles

ByeTruck Launches Direct Purchase Platform for Semi-Trucks, Vans, and Commercial Vehicles

LOS ANGELES, CA, UNITED STATES, March 9, 2026 /EINPresswire.com/ — ByeTruck has announced the official launch of its

March 9, 2026

Simply Shenandoah (Senior Loan) Rural EB-5 Project Receives USCIS Form I-956F Exemplar Approval

Simply Shenandoah (Senior Loan) Rural EB-5 Project Receives USCIS Form I-956F Exemplar Approval

Form I-956F approval means that the project’s documents have been reviewed by USCIS and were found to be compliant with

March 9, 2026

Three Moms Launch ‘Raising AI’ to Close the Parent AI Literacy Gap and Support Families in the AI Age

Three Moms Launch ‘Raising AI’ to Close the Parent AI Literacy Gap and Support Families in the AI Age

New partnership puts parents, especially moms, at the center of AI literacy, launching during Women’s History Month and

March 9, 2026