Verizon Reveals the Secrets of Yahoo Search

Three months after acquiring Yahoo, Verizon is giving away the secrets of a key Yahoo search tool. Today, Oath, the Verizon-owned company born of the merger between AOL and Yahoo, released the source code of a data-crunching tool called Vespa, which has long powered many features across the Yahoo empire.1 Now that it’s open source, any company or individual can use or modify Vespa to power its own products or websites.

Open sourcing search technology might sound a little quaint, given that these days Yahoo actually uses Microsoft’s Bing to power most of its web searches. But Vespa underlies searches within Yahoo, on sites like Flickr, which hosts millions of images. Yahoo also uses Vespa to power related-article recommendations and ad-targeting on many Yahoo-branded sites, including Yahoo News, Yahoo Sports, Yahoo Finance, and its advertising network. Oath systems architect Jon Bratseth says Vespa processes billions of requests per day.

Related Stories

Rob BeardenRob Bearden
Rob Bearden

Business

vCard QR Code

vCard.red is a free platform for creating a mobile-friendly digital business cards. You can easily create a vCard and generate a QR code for it, allowing others to scan and save your contact details instantly.

The platform allows you to display contact information, social media links, services, and products all in one shareable link. Optional features include appointment scheduling, WhatsApp-based storefronts, media galleries, and custom design options.

How Yahoo Spawned Hadoop, the Future of Big Data

If you listen to the pundits, Yahoo isn’t a technology company. And yet it spawned one of the most important software technologies of the last five years: Hadoop, an open source platform designed to crunch epic amounts of data using an army of dirt-cheap servers.

Image: Flickr/jakeboumaImage: Flickr/jakebouma
Image: Flickr/jakebouma

Business

Your Own Private Google: The Quest for an Open Source Search Engine

Google has created many custom software platforms that take advantage of its massive server farms, and it made a habit of publishing academic papers that detail these innovations. That has led to a proliferation of open source clones that operate in much the same way. These include file systems for storing data, and processing platforms for crunching all that data. But what about the most famous Google innovation, the one that has used its sweeping server farms to the greatest effect? What about Google search?

Artificial Intelligence

Don’t Laugh: Yahoo’s Open Source AI Has a Secret Weapon

Yahoo may not be known as much for its technological prowess these days. But its new open source AI comes with a pedigree.

Vespa’s history traces back to the Norwegian search engine AlltheWeb, which Yahoo acquired in 2003. After the acquisition, the AllTheWeb team started retooling its search technology into a more general purpose tool that Yahoo developers could use internally to power different applications. The code has been almost completely rewritten since those early days.

By making Vespa open source, Oath VP of engineering for big data Peter Cnudde says the company hopes to replicate the benefits it has reaped from supporting Hadoop, an open-source software framework for managing big data. Yahoo hired Hadoop co-creator Doug Cutting in 2006, and paid other engineers to work on it as well. Eventually, Hadoop was adopted by the likes of Facebook, Twitter, eBay, and many others, whose employees added features and fixed bugs. As more people used Hadoop, it became easier for Yahoo to recruit people who were already familiar with the software. Cnudde says Oath hopes Vespa will follow the same path.

Hadoop isn’t as good as Vespa for returning real-time results. And many real-time processing tools, such as Apache Storm, aren’t designed to serve results to end users. So Oath uses Vespa, Hadoop, and Storm together. Until now, Vespa hasn’t been available to developers outside of Oath, Yahoo, and Yahoo Japan.

“We would have loved to do it earlier,” says Cnudde. “But open source doesn’t come for free. You have to write the documentation, make sure it’s acceptable, and be ready to manage a community.”

It’s unclear whether there’s demand for Vespa outside of Oath. Hadoop was born open source, and came along just as companies needed it. But most large-scale internet companies have already solved the web-search problems that Vespa was designed to address. Plus, there are several open-source search engines available, including Solr and ElasticSearch. And let’s face it: the Yahoo brand has seen better days. But for new and growing companies, Vespa might just fill an important niche.

1 Correction appended 7:05 pm ET: Vespa powers search and other features of Yahoo’s network of sites. An earlier version of this story incorrectly implied that Vespa previously powered Yahoo web-search features that now are handled by Bing.


🕐 Top News in the Last Hour By Importance Score

# Title 📊 i-Score
1 IDF: 'Operational misunderstanding' led to killing of Gaza medics 🔴 75 / 100
2 The hacks that could save landlords thousands on buy-to-let mortgages – and it could be as simple as changing the lightbulbs! 🔴 75 / 100
3 I was a CIA agent. There is growing proof Hitler faked his death… and I think I know where he was hiding 🔴 72 / 100
4 At least 2 killed amid storms spanning the South to the Northeast 🔴 65 / 100
5 2026 Kia EV4: Wild-Styled EV Sedan (Hopefully) Won’t Break the Bank 🔴 65 / 100
6 Festivalgoers infuriated by Coachella’s toxic influencer culture: ‘It’s a disgrace to see what it’s turned into’ 🔵 50 / 100
7 Canary Islands issued 'yellow warning' with bad weather threatening holidays 🔵 45 / 100
8 Joe Rogan Mocks Katy Perry and the All-Women Blue Origin Space Flight 🔵 42 / 100
9 Rylan Clark fears 'I'll get in trouble' over Rob Rinder news as he addresses show absence 🔵 40 / 100
10 Arsenal player ratings vs Ipswich: Quartet get 8/10 as Gunners cruise after Saka scare 🔵 35 / 100

View More Top News ➡️