Yandex Palekh Algorithm Catches The Long Tail With Machine Learning

Nov 3, 2016 - 8:22 am 2 by

Yandex Palekh

Yesterday, Yandex announced that they launched something similar to the Google RankBrain - well, they didn't say that, I am.

They launched what they call Palekh which is name of a Russian city, the flag of that city is of a firebird, which you can see in the image above. Why the firebird, well, it has a long tail and this algorithm aims at improving the quality of the results for long tail queries.

Yandex told us that they handle about 100 million queries per day fall under the "long-tail" classification within their search engine. That is about 40% of all the queries performed on that search engine.

So they wanted to make the results better by better understanding those queries. Yandex told me that basically," the technology allows us to understand the meaning behind every query, and not just look for similar words."

For that, we're starting to use neural networks as one of 1500 factors of ranking - we've managed to teach our neural networks to see the connections between a query and a document even if they don't contain common words. This has been made possible by converting the words from billions of search queries into numbers (with groups of 300 each) and putting them in 300-dimensional space - now every document has its own vector in that space. If the numbers of a query and numbers of a document are near each other in that space, then the result is relevant. This technology is called a "semantic vector".

They are using "billions of queries from logs and relying on documents' headlines and search queries, not documents' texts yet." "We also have many targets (long click prediction, CTR, "click or not click" models etc.) that are teaching our neural network - our research has showed that using more targets is more effective," they added. So this is a self learning, machine learning algorithm.

Yandex is a very very important search engine for Russian users.

Forum discussion at Twitter.

 

Popular Categories

The Pulse of the search community

Follow

Search Video Recaps

 
- YouTube
Video Details More Videos Subscribe to Videos

Most Recent Articles

Search Forum Recap

Daily Search Forum Recap: November 20, 2024

Nov 20, 2024 - 10:00 am
Google Search Engine Optimization

Google Site Reputation Abuse Policy Now Includes First Party Involvement Or Content Oversight

Nov 20, 2024 - 7:51 am
Google

Google Lens Updated For In-Store Shopping

Nov 20, 2024 - 7:41 am
Google Search Engine Optimization

Google Makes It Clear It Has Both Site Wide & Page Level Ranking Signals

Nov 20, 2024 - 7:31 am
Other Search Engines

ChatGPT's Search Marketing Share vs Google

Nov 20, 2024 - 7:21 am
Bing Search

Bing Video Search Tests Categorizing Videos

Nov 20, 2024 - 7:11 am
Previous Story: Webmasters React To Google's 200 Rankings Factors Claim While Googlers Look On