The Fort Worth Press - As AI data scrapers sap websites' revenues, some fight back

USD -
AED 3.672494
AFN 64.000493
ALL 81.450493
AMD 370.780251
ANG 1.789884
AOA 917.999881
ARS 1392.559404
AUD 1.38748
AWG 1.8
AZN 1.695216
BAM 1.669697
BBD 2.01454
BDT 122.725158
BGN 1.668102
BHD 0.37765
BIF 2976
BMD 1
BND 1.275896
BOB 6.911331
BRL 4.954702
BSD 1.000226
BTN 94.881811
BWP 13.592996
BYN 2.822528
BYR 19600
BZD 2.011629
CAD 1.35921
CDF 2319.999847
CHF 0.780701
CLF 0.022861
CLP 899.749905
CNY 6.82825
CNH 6.816975
COP 3657.25
CRC 454.73562
CUC 1
CUP 26.5
CVE 94.449942
CZK 20.76365
DJF 177.719703
DKK 6.36849
DOP 59.49346
DZD 132.464709
EGP 53.495099
ERN 15
ETB 156.999734
EUR 0.85227
FJD 2.190603
FKP 0.736618
GBP 0.735645
GEL 2.679571
GGP 0.736618
GHS 11.202571
GIP 0.736618
GMD 72.99985
GNF 8774.999794
GTQ 7.641507
GYD 209.25239
HKD 7.833965
HNL 26.619786
HRK 6.4231
HTG 131.024649
HUF 308.5225
IDR 17376
ILS 2.94745
IMP 0.736618
INR 94.92485
IQD 1310
IRR 1313999.999982
ISK 122.559434
JEP 0.736618
JMD 156.725146
JOD 0.708968
JPY 156.774502
KES 129.095472
KGS 87.420496
KHR 4012.502072
KMF 420.000157
KPW 899.999976
KRW 1468.440084
KWD 0.307899
KYD 0.833543
KZT 463.288124
LAK 21979.999983
LBP 89550.000285
LKR 319.671116
LRD 183.875001
LSL 16.659854
LTL 2.95274
LVL 0.604891
LYD 6.349683
MAD 9.251249
MDL 17.233504
MGA 4150.000427
MKD 52.539606
MMK 2099.490131
MNT 3577.850535
MOP 8.070846
MRU 39.969687
MUR 46.76048
MVR 15.455009
MWK 1741.552774
MXN 17.429855
MYR 3.952497
MZN 63.895715
NAD 16.660055
NGN 1375.980277
NIO 36.71013
NOK 9.27605
NPR 151.803598
NZD 1.689805
OMR 0.384489
PAB 1.000201
PEN 3.507503
PGK 4.33875
PHP 61.469602
PKR 278.77498
PLN 3.61942
PYG 6151.626275
QAR 3.643499
RON 4.429904
RSD 99.996991
RUB 75.001641
RWF 1461.5
SAR 3.74998
SBD 8.04211
SCR 14.88162
SDG 600.499176
SEK 9.213799
SGD 1.27268
SHP 0.746601
SLE 24.599275
SLL 20969.496166
SOS 571.000167
SRD 37.457968
STD 20697.981008
STN 21.21
SVC 8.7523
SYP 110.524981
SZL 16.659994
THB 32.417043
TJS 9.381822
TMT 3.505
TND 2.88175
TOP 2.40776
TRY 45.19573
TTD 6.789386
TWD 31.590949
TZS 2610.000207
UAH 43.949336
UGX 3760.987334
UYU 39.889518
UZS 11949.999996
VES 488.942755
VND 26338.5
VUV 117.651389
WST 2.715189
XAF 560.041494
XAG 0.013321
XAU 0.000218
XCD 2.70255
XCG 1.80265
XDR 0.69563
XOF 559.99986
XPF 102.15034
YER 238.600947
ZAR 16.58375
ZMK 9001.195339
ZMW 18.67895
ZWL 321.999592
  • GSK

    -0.7000

    51.61

    -1.36%

  • CMSD

    0.1500

    23.28

    +0.64%

  • BCC

    -1.1400

    78.13

    -1.46%

  • BCE

    0.1800

    23.96

    +0.75%

  • RIO

    0.1000

    100.58

    +0.1%

  • JRI

    -0.0100

    12.98

    -0.08%

  • BP

    -0.9700

    46.41

    -2.09%

  • CMSC

    0.0600

    22.88

    +0.26%

  • BTI

    -0.0900

    58.71

    -0.15%

  • NGG

    -1.0600

    88.48

    -1.2%

  • RBGPF

    0.5000

    63.1

    +0.79%

  • AZN

    -2.6300

    184.74

    -1.42%

  • RELX

    -0.2400

    36.35

    -0.66%

  • RYCEF

    0.5500

    16.35

    +3.36%

  • VOD

    0.3500

    16.15

    +2.17%

As AI data scrapers sap websites' revenues, some fight back
As AI data scrapers sap websites' revenues, some fight back / Photo: © AFP

As AI data scrapers sap websites' revenues, some fight back

A swarm of AI "crawlers" is running rampant on the internet, scouring billions of websites for data to feed algorithms at leading tech companies -- all without permission or payment, upending the online economy.

Text size:

Before the rise of AI chatbots, websites allowed search engines to access their content in return for increased visibility, a system that rewarded them with traffic and advertising revenues.

But the rapid development of generative AI has allowed tech giants like Google and OpenAI to harvest information for their chatbots with web crawlers, without humans ever needing to visit the original sites.

Traditional content producers, such as media outlets, are being outpaced by AI crawlers, which have cut into their online operations and advertising revenues.

"Sites that gave bots access to their content used to get readers in exchange," said Kurt Muehmel, head of AI strategy at data management firm Dataiku.

But the arrival of generative AI "completely breaks" that model, he told AFP.

Wikipedia's human internet traffic fell by eight percent between 2024 and 2025 because of a rise in AI search engine summaries, the online encyclopaedia reported last month.

"The fundamental tension is that the new business of the internet that is AI-driven doesn't generate traffic," said Matthew Prince, CEO of Cloudflare, an American internet services provider.

- 'No trespassing' -

Cloudflare, which processes more than 20 percent of all internet traffic, announced this summer a new measure aimed at blocking AI crawlers from accessing content without payment or permission from website owners.

"It's basically like putting a speed limit sign or a no trespassing sign," Prince told AFP on the sidelines of the Web Summit in Lisbon.

"Badly behaving bots can get by that, but we can track that... Over time, we can tighten these controls in a way that we're confident the AI companies can't get through."

The measure, which applies to more than 10 million websites, has already "attracted the attention of artificial intelligence giants", he added.

On a smaller scale, American startup TollBit is providing online news publishers with tools to block, monitor and monetise AI crawler traffic.

"The internet is a highway," said CEO and co-founder Toshit Panigrahi, who described the company as a "tollbooth on the internet".

TollBit works with more than 5,600 sites, including USA Today, Time magazine and the Associated Press, allowing media outlets to set their own access fees for their content.

The analytics are free for publishers, but AI companies are charged a "transaction fee for every piece of content they access".

But for Muehmel, the online takeover by AI crawlers cannot be resolved with only "partial measures or by an individual company".

"This is an evolution of the entire internet economy, which will take years," he said.

If the bot swarm continues to roam freely online, "all of the incentives for content creation are going to go away," Prince said.

"That would be a loss, not just for us humans that want to consume it, but actually for the AI companies that need original content in order to train their systems."

M.McCoy--TFWP