Google and Reddit has come down to an agreement where Google will be able to acquire real-time access to all of Reddit’s content. This will allow Google to display even more content from Reddit than it already does, as well as provide access to all of Reddit’s information for the purpose of training models.
The agreement between Google and Reddit makes it clear that this will lead to an increase in the exposure of Reddit discussions across a variety of Google products. These products may include a variety of search surfaces that cover a wide range of topics and situations.
Google, Reddit and AI
The ability of language models to comprehend human conversations and writing styles will be enhanced by Google’s access to a more extensive variety of content from Reddit that is presented in a structured fashion. Because artificial intelligence is being employed more and more in search, this could potentially have an impact on how material is understood and ranked in Google search.
Through the use of Google‘s Vertex AI platform, Reddit will be able to enhance its search powers and develop more “capabilities” with the platform. According to reports, the annual value of the arrangement for Reddit is stated to be $60 million.
Because Reddit has become such a popular destination for people to have discussions on virtually any subject, it has become such a popular destination for people who are looking for information that searchers add the word “Reddit” to their queries in order to surface content directly from Reddit and completely avoid Google’s search results.
A deep source of conversational data written in many styles of writing, conversations on Reddit are especially excellent for training large language models due to the range of content themes. This is because Reddit is a very popular social networking website.
Organised Content
The term “unstructured data” refers to the content that can be found on the internet. It is necessary for machines to process unstructured data in order to rid themselves of extraneous components, such as navigation, and to extract the primary content. It is also necessary for it to make sense of the content that was upvoted and downvoted.
On the other hand, structured data is data that has already been sorted into its component pieces in order to eliminate any ambiguity regarding the specifics of the data.
Google now has access to all of that data in real time and in a structured style, which will make it simpler for Google to make sense of the information and use it more effectively. Additionally, thanks to what Google refers to as “enhanced signals,” Google will be able to show the data in ways that are more helpful to users.
In the announcements made by both Google and Reddit, it is stated that one of Google’s goals is to display more material from Reddit.
“Over the years, we’ve seen that people increasingly use Google to search for helpful content on Reddit to find product recommendations, travel advice and much more. We know people find this information useful, so we’re developing ways to make it even easier to access across Google products. This partnership will facilitate more content-forward displays of Reddit information that will make our products more helpful for our users and make it easier to participate in Reddit communities and conversations.”