• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Exclusive

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

An hour in the Oval Office with President Trump Fortune Editor-in-Chief: Alyson Shontell sat down with President Trump in the Oval Office for an hour. Tariffs, Intel, AI, Boeing, Iran—and the question every CEO eventually has to answer: who's next?

TechFuture of Work

Why Cloudera is saying ‘Goodbye, MapReduce’ and ‘Hello, Spark’

By
Derrick Harris
Derrick Harris
Down Arrow Button Icon
By
Derrick Harris
Derrick Harris
Down Arrow Button Icon
September 9, 2015, 7:08 AM ET
Hadoop allows some of the world's largest companies to store and process datasets on clusters of commodity hardware.
Hadoop allows some of the world's largest companies to store and process datasets on clusters of commodity hardware.Photograph by Jetta Productions/Getty Images

Cloudera, a company that helped popularize Hadoop as a platform for analyzing huge amounts of data when it was founded in 2008, is overhauling its core technology. The One Platform Initiative the company announced Wednesday lays out Cloudera’s plan to officially replace MapReduce with Apache Spark as the default processing engine for Hadoop.

Cloudera chief technologist Eli Collins said the company is “at best” halfway through the process from a technology standpoint and should be done in about a year. When complete, Spark should have similar levels of security, manageability, and scalability as MapReduce, and should be equally integrated with the rest of the technologies that comprise the ever-expanding Hadoop platform.

Collins said Spark’s existing weaknesses are “OK for early adopters, but really not acceptable to our customer base” as a whole. Cloudera says it has more than 100 customers running Spark in production—including Equifax, Experian, and CSC—but realizes that broader adoption and an improved Spark experience are a chicken-or-egg type of problem.

“If Spark is everywhere, then it’s a safe technology choice,” Collins explained. “And if it’s a safe technology choice, we can move the ecosystem.”

oneplatform
Cloudera

The history of the move to Spark is in some ways as old Hadoop itself. Google (GOOG) created MapReduce in the early 2000s as a faster, easier implementation of existing parallel processing approaches, and the creators of Hadoop developed an open source version of Google’s work. However, while MapReduce proved revolutionary for early big data workloads (nearly every major web company is a heavy Hadoop user), its limitations became more clear as Hadoop and big data became mainstream technology movements.

Large enterprises, technology startups and other potential Hadoop users saw the potential in storing lots of data using the Hadoop file system and in analyzing that data, but they wanted something faster and more flexible than MapReduce. It was designed for indexing the web at places like Google and Yahoo (YHOO), a batch-processing job where latency was measured in hours rather than milliseconds. MapReduce is also notoriously difficult to program, a problem that helped exacerbate the “big data skills gap” to which analyst firms and consultants have been pointing for years.

When Spark was created a few years ago at the University of California, Berkeley, it was the solution Hadoop vendors, Hadoop users, and venture capitalists alike needed to resolve their MapReduce woes. Spark is significantly faster and easier to program than MapReduce, meaning it can handle a much broader array of jobs. In fact, the project includes libraries for real-time data analysis, interactive SQL analysis, and machine learning, in addition to its core MapReduce-style engine.

And better yet, Spark is designed to integrate with Hadoop’s native file system. This means Hadoop users don’t have to move their terabytes or even petabytes of data elsewhere in order to take advantage of Spark. By 2013, major VC firms had began putting millions of dollars into Databricks, a startup founded by the creators of Spark, and major Hadoop vendors Cloudera, MapR, and Hortonworks (HDP) were beginning to integrate Spark into their Hadoop distributions.

spark
Databricks

“Spark is one of the few components where you’ve seen 100% adoption and 100% investment [from the Hadoop community],” Cloudera’ Collins said. Even Yahoo, which sponsored and drove Hadoop’s development early on and from which Hortonworks spun out, should be off of MapReduce within a year, he added. And IBM (IBM) announced in June a $300 million commitment to help develop Spark as the future of seemingly all analytic workloads.

That’s saying something in a technology market that has been characterized by corporate one-upmanship and flat-out insults over the years.

So now, Cloudera and the greater Hadoop community are trying to take the Spark transition over the finish line by making sure it works where MapReduce works and can handle as much (or nearly as much) data as MapReduce can. The latter is challenging—large Spark users might run it on hundreds of nodes, whereas large MapReduce users might run it on tens of thousands of nodes—but heavy Spark users and developers are working to close the gap.

“If we really want to replace MapReduce, we have to dot all those Is and cross all those Ts,” Collins said.

If the Hadoop community does it right, it could reap serious rewards as the technology, and “big data” in general, begins really catching on among mainstream businesses. “The vast majority of people who’ll use Hadoop in the next 10 years haven’t seen it yet,” Collins said. “…and Spark will be there [when they do.]”

For more about the business value of data analytics, watch this Fortune video:

Sign up for Data Sheet, Fortune’s daily newsletter about the business of technology.

 

About the Author
By Derrick Harris
See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

altman
CommentarySam Altman
Musk vs. Altman: AI safety cannot be one man’s job
By Stavros GadinisMay 18, 2026
9 hours ago
Pope Leo launches an AI commission days before he releases a papal letter alongside Anthropic cofounder Christopher Olah
AIPope
Pope Leo launches an AI commission days before he releases a papal letter alongside Anthropic cofounder Christopher Olah
By Catherina GioinoMay 18, 2026
9 hours ago
John Ketchum, CEO of NextEra Energy, speaks during BlackRock's 2026 Infrastructure Summit in Washington, DC, on March 11, 2026. Photographer: Daniel Heuer/Bloomberg via Getty Images
EnergyNextEra Energy
NextEra’s $67 billion Dominion takeover creates the world’s largest utility—just in time to win the AI data-center power surge
By Jordan BlumMay 18, 2026
10 hours ago
Harvard University banners hang in front of a building
CryptoCryptocurrency
Harvard sold off its entire $87 million Ethereum stake just one quarter after buying it
By Jack KubinecMay 18, 2026
10 hours ago
Not the Allbirds effect: Japan’s top bidet maker Toto has been quietly making chip supplies for decades, and the stock market finally noticed
AIChips
Not the Allbirds effect: Japan’s top bidet maker Toto has been quietly making chip supplies for decades, and the stock market finally noticed
By Catherina GioinoMay 18, 2026
11 hours ago
monet
CybersecuritySocial Media
6.7 million people thought they were ripping apart an AI-generated Monet painting. But it was real
By Nick LichtenbergMay 18, 2026
11 hours ago

Most Popular

The Bezos family just donated $100 million to help achieve one of Mayor Zohran Mamdani’s top campaign promises
Politics
The Bezos family just donated $100 million to help achieve one of Mayor Zohran Mamdani’s top campaign promises
By Jake AngeloMay 12, 2026
6 days ago
While Trump insisted the Iran war would end ‘soon,’ an account in his name was buying millions in oil, defense and gold
Economy
While Trump insisted the Iran war would end ‘soon,’ an account in his name was buying millions in oil, defense and gold
By Eva RoytburgMay 18, 2026
12 hours ago
Microsoft AI chief gives it 18 months—for all white-collar work to be automated by AI
AI
Microsoft AI chief gives it 18 months—for all white-collar work to be automated by AI
By Jake AngeloMay 16, 2026
3 days ago
Current price of oil as of May 18, 2026
Personal Finance
Current price of oil as of May 18, 2026
By Joseph HostetlerMay 18, 2026
17 hours ago
EXCLUSIVE: An hour in the Oval Office with the CEO-in-Chief, President Trump
Politics
EXCLUSIVE: An hour in the Oval Office with the CEO-in-Chief, President Trump
By Alyson ShontellMay 18, 2026
23 hours ago
The top foreign holders of U.S. debt may soon dump Treasury bonds and bring their money back home, potentially spiking borrowing costs
Economy
The top foreign holders of U.S. debt may soon dump Treasury bonds and bring their money back home, potentially spiking borrowing costs
By Jason MaMay 17, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.