Recently, Baidu Baike – the Chinese equivalent of Wikipedia – updated its robots.txt file, which instructs search engines which web addresses they can access, and completely blocked Googlebot and Bingbot from indexing content from its platform.
Photo: Shutterstock
This move shows Baidu's efforts to protect its online assets amid the growing demand for big data to develop artificial intelligence (AI) models and applications.
Following Baidu Baike's robots.txt update, an SCMP survey revealed that many entries from the platform still appear in Google and Bing search results, possibly from previously archived content.
More than two years after OpenAI launched ChatGPT, many of the world's major AI developers are signing agreements with content publishers to access quality content for their GenAI projects.
OpenAI signed an agreement with Time magazine in June to access its entire archive spanning over 100 years.
Cao Phong (according to SCMP)
Source: https://www.congluan.vn/baidu-chan-google-va-bing-thu-thap-noi-dung-truc-tuyen-post309081.html






Comment (0)