Cloudflare is taking a stand in opposition to AI web site scrapers

July 3, 2024

106

Cloudflare has unveiled a groundbreaking free tool designed to thwart AI-powered bot scrapers, preventing corporations from pilfering website content to train massive language models without permission from the site owners. The cloud service provider is now offering access to the software to its entire customer base, including those on free plans. The company stated that this function will automatically stay current over time as it identifies and addresses newly discovered fingerprints of bot activity deemed to be excessively scraping the internet for model training.

Cloudflare’s latest update revealed that customers are struggling with an unprecedented surge in bot traffic designed to train generative AI models by scraping online content. According to internal data, a staggering 85.2% of customers have opted to block AI-powered bots, even those capable of accurately identifying themselves and gaining access to their online platforms?

Cloudflare has identified the top-performing bots of the past year. The ByteDance-owned Bytespider bot attempted to access approximately 40% of internet sites under Cloudflare’s protection, with a successful attempt on around 35%. They had been among the top four AI bot crawlers in terms of request volume, alongside Amazonbot and ClaudeBot, on Cloudflare’s community.

Blocking AI bots from accessing content remains an ongoing challenge requiring persistent efforts. As the drive to design and deploy fashion trends at lightning-fast speeds has intensified, some companies have resorted to circumventing or flagrantly disregarding existing regulations governing anti-scraping measures. Without explicit permission from website owners, unauthorized web scraping poses serious legal concerns and ethical dilemmas? However, should Cloudflare, a prominent cloud-based security and performance company, actively seek to curb such behavior, certain consequences might arise.

“We’re concerned that certain AI companies may deliberately disregard guidelines to create content and continually modify their approaches to evade detection by bots,” the company stated. We will continue monitoring the situation, strengthening our AI Scrapers and Crawlers protocols, and refining our machine learning models to preserve a web ecosystem where content creators have autonomy over how their content is utilized, either for training or inference purposes.

Cloudflare is taking a stand in opposition to AI web site scrapers

Related Articles

Taiwan’s “silicon protect” may very well be weakening

U.S. Military Achieves First Air-to-Air Kill with Armed FPV Drone

Stäubli Robotics rises to allow baking inspection and dealing with

LEAVE A REPLY Cancel reply

Latest Articles

Taiwan’s “silicon protect” may very well be weakening

U.S. Military Achieves First Air-to-Air Kill with Armed FPV Drone

Stäubli Robotics rises to allow baking inspection and dealing with

Estonian MedTech startup Higher Drugs raises €1 million to develop AI mannequin for kidney most cancers detection

‘Superman’ and ‘Star Wars’ Actor Terence Stamp Dies at Age 87