• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Exclusive

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

NewslettersCFO Daily

Is your chatbot hallucinating? A ‘bot debate’ could produce better A.I. answers, according to new research

Sheryl Estrada
By
Sheryl Estrada
Sheryl Estrada
Senior Writer and author of CFO Daily
Down Arrow Button Icon
Sheryl Estrada
By
Sheryl Estrada
Sheryl Estrada
Senior Writer and author of CFO Daily
Down Arrow Button Icon
May 31, 2023, 7:00 AM ET
Woman hand show phone with voice chat bot hologram, business network icons. Artificial intelligence, worldwide and binary. Concept of helpdesk
ismagilov—Getty Images

Good morning.

Recommended Video

CFOs are slow to embrace generative A.I. and the fact that a chatbot can hallucinate doesn’t help. 

Generative A.I. large language models (LLMs) that fuel chatbots are designed to understand and generate humanlike text. However, because they leverage billions of data points to predict the next word in a string of text, sometimes when not knowing the right answer to a prompt, they hallucinate, or create a response that may sound plausible but is factually incorrect or unrelated to the context. 

A group of MIT researchers released a new paper that finds a debate between chatbots can improve the reasoning and factual accuracy of LLMs. It’s like a bot debate club, except the bot can essentially debate iterations of itself. 

“The debate procedure allows a language model to critique and reflect on its opinions and opinions of other agents which allows it to sharpen its reasoning and answers,” Yilun Du, a researcher at MIT, and a coauthor of the paper, tells me. The researchers documented multiple instances of language models debating with each other over multiple rounds and reaching an improved shared answer.

How does this work? “The debates can occur in a single model (or bot),” says Du, who is a former researcher at OpenAI. “A single language model is replicated multiple times to generate multiple bots. Given a question, each bot then generates a different answer (the learned model behind the bot is the same across bots). The bots can then debate each other.”

However, the study also found that competing chatbots can spar with each other. “We also showed that you can have debates between different models like [OpenAI’s] ChatGPT and [Google’s] Bard to solve a task,” Du says. “But the majority of experiments use the same model.”

Michael Schrage, a research fellow at the MIT Sloan School Initiative on the Digital Economy, is not one of the authors of the paper but says he thinks the research is well done. “This kind of collective intelligence/voting approach is not uncommon,” Schrage says. “But to my knowledge, this is the first publication where I’ve seen it in a LLM context.”

Schrage has been exploring generative A.I. and LLMs with a focus on harnessing them as next-generation recommender systems. “I have already used large language models to generate business scenarios (some finance-related, others not) for both clients and classes,” he says. “I’ve found these scenarios constructive, provocative, and believable. But, again, these are LLMs, not large computational models.”

Foundational LLMs need to be fine-tuned and connected to software where calculations and computations are likely to be accurate, as well as transparent, explainable, and interpretable, he says. “That said, I think any financial analyst or auditor or accountant would be wildly irresponsible and unprofessional to rely on LLM-driven financial calculations at this time,” Schrage says.

He continues, “I strongly believe that—with guardrails and thoughtful, intentional prompts—FP&A folks and other financial modelers can get a lot of value very quickly by skillfully employing LLMs. The MIT research paper shows just how much is going on in the ‘computationally credible’ LLM space.”

Does Du think the issues with hallucinations or false information are valid concerns for finance professionals? “Yes,” he says. It’s very important to treat the responses from generative A.I. “not as ground truth, but rather just a possible source of information,” he says. Du suggests using responses as “ideas,” but then “separately verify yourself that they are correct.” He adds, “I believe my research is a step to making this source of information more accurate.” 

Let the debate begin.


Sheryl Estrada
sheryl.estrada@fortune.com

Big deal

A new report by Pew Research Center found that 58% of U.S. adults surveyed have heard of ChatGPT. Of that percentage, 19% say they have used it for entertainment, 14% have used it to learn something new, and 12% are currently working for pay and have used ChatGPT at work. Adults under 30 who have heard of ChatGPT are far more likely than those 65 and older to have used the chatbot for entertainment (31% vs. 4%). Pew also asked respondents about their experience with the chatbot. Fifteen percent say it has been extremely useful, and 20% say it's very useful. Meanwhile, 39% say it has been somewhat useful. The data is based on a survey of more than 10,000 U.S. adults conducted March 13-19, 2023.

Courtesy of Pew Research Center

Going deeper

"Rise of AI: Is Your Company Prepared for Generative AI?" is a new episode of the Wharton School's Ripple Effect podcast. Professor Rahul Kapoor explains why now is the time for business leaders to develop new frameworks to manage the changes ahead.

Leaderboard

Julia Brau Donnelly was named CFO at Pinterest, Inc. (NYSE: PINS), effective June 20. Donnelly will be taking on the role from Todd Morgenfeld. As previously announced, Morgenfeld will transition from Pinterest to pursue new career opportunities on July 1. Donnelly joins Pinterest from Wayfair, where she was most recently VP and global head of finance and accounting. During her more than seven-year tenure, she held several positions of increasing responsibility within the finance function. She led a global team of 250 employees across all of accounting and finance, including strategic finance, investor relations, corporate development, FP&A, accounting, tax and finance operations. Before Wayfair, she was a private equity investor in technology and media companies at Thomas H. Lee Partners in Boston.

Yafei (Roxi) Wen is resigning from her position of CFO at Invitae (NYSE: NVTA), a medical genetics company, effective June 30. Wen is pursuing other opportunities; the company has initiated a search for a new CFO. Wen will continue in her role through the end of the second quarter. Christine Gorjanc, longtime chair of the audit committee of the board of directors, will assume the role of interim CFO, effective July 1. Wen's resignation is not the result of any disagreement with the company on any matter related to operations, policies, or procedures, according to Invitae.

Overheard

"You're not going to fix these things if you are just sitting across the Pacific yelling at each other. So, I'm hoping we have real engagement."

—JPMorgan Chase CEO Jamie Dimon said on Wednesday during the JPMorgan Global China Summit in Shanghai, Reuters reported. Simon was answering a question about diplomatic relations between China and the U.S. and emphasized the need to have "real engagement" to resolve their security and trade matters. 

This is the web version of CFO Daily, a newsletter on the trends and individuals shaping corporate finance. Sign up to get CFO Daily delivered free to your inbox.

About the Author
Sheryl Estrada
By Sheryl EstradaSenior Writer and author of CFO Daily
LinkedIn iconTwitter icon

Sheryl Estrada is a senior writer at Fortune, where she covers the corporate finance industry, Wall Street, and corporate leadership. She also authors CFO Daily.

See full bioRight Arrow Button Icon

Latest in Newsletters

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Newsletters

Women’s representation on boards of directors falls below 30%—but there’s one bright spot
NewslettersMPW Daily
Women’s representation on boards of directors falls below 30%—but there’s one bright spot
By Emma HinchliffeMay 18, 2026
13 minutes ago
US President Donald Trump speaks before signing an executive order in the Oval Office at the White House in Washington, DC, as Commerce Secretary Howard Lutnick looks on.
NewslettersCFO Daily
Trump’s new corporate playbook: Why the administration is taking equity stakes in companies like Intel
By Sheryl EstradaMay 18, 2026
5 hours ago
A panel on Gen Z workers sit alongside Fortune's Kristin Stoller at the Fortune Workplace Innovation Summit.
NewslettersFortune Workplace Innovation
AI in the workplace is stumbling. Fortune’s Workplace Innovation Summit will dive in to why
By Kristin StollerMay 18, 2026
5 hours ago
Wallet makers are the quiet backbone of the crypto industry. Now they want to be banks
NewslettersFortune Crypto
Wallet makers are the quiet backbone of the crypto industry. Now they want to be banks
By Jeff John RobertsMay 18, 2026
6 hours ago
Trump’s leadership model has a succession problem
C-SuiteNext to Lead
Trump’s leadership model has a succession problem
By Ruth UmohMay 18, 2026
7 hours ago
Inside Trump’s vision of America as a shareholder in U.S. companies: ‘I should have asked for more’
NewslettersCEO Daily
Inside Trump’s vision of America as a shareholder in U.S. companies: ‘I should have asked for more’
By Diane BradyMay 18, 2026
8 hours ago

Most Popular

Microsoft AI chief gives it 18 months—for all white-collar work to be automated by AI
AI
Microsoft AI chief gives it 18 months—for all white-collar work to be automated by AI
By Jake AngeloMay 16, 2026
2 days ago
The top foreign holders of U.S. debt may soon dump Treasury bonds and bring their money back home, potentially spiking borrowing costs
Economy
The top foreign holders of U.S. debt may soon dump Treasury bonds and bring their money back home, potentially spiking borrowing costs
By Jason MaMay 17, 2026
1 day ago
The Bezos family just donated $100 million to help achieve one of Mayor Zohran Mamdani’s top campaign promises
Politics
The Bezos family just donated $100 million to help achieve one of Mayor Zohran Mamdani’s top campaign promises
By Jake AngeloMay 12, 2026
6 days ago
'No one was coming to save me': How Reese Witherspoon built a $900 million company from a problem Hollywood wouldn't fix
Success
'No one was coming to save me': How Reese Witherspoon built a $900 million company from a problem Hollywood wouldn't fix
By Sydney LakeMay 17, 2026
1 day ago
SpaceX heads into a record-shattering IPO with the 'deepest moat that exists today' as investors vow to 'never bet against Elon'
Innovation
SpaceX heads into a record-shattering IPO with the 'deepest moat that exists today' as investors vow to 'never bet against Elon'
By Jason MaMay 16, 2026
2 days ago
Gen X is the most indebted generation in America. Their employers can fix that
Commentary
Gen X is the most indebted generation in America. Their employers can fix that
By Mary MorelandMay 17, 2026
1 day ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.