The Fort Worth Press - As AI data scrapers sap websites' revenues, some fight back

USD -
AED 3.6725
AFN 63.49826
ALL 81.649957
AMD 368.209891
ANG 1.790403
AOA 917.503082
ARS 1436.737304
AUD 1.426269
AWG 1.8
AZN 1.699145
BAM 1.685177
BBD 2.015096
BDT 122.817901
BGN 1.69088
BHD 0.377104
BIF 2991
BMD 1
BND 1.281762
BOB 6.938712
BRL 5.090801
BSD 1.000526
BTN 94.560525
BWP 13.406112
BYN 2.76997
BYR 19600
BZD 2.012252
CAD 1.414141
CDF 2320.000121
CHF 0.805425
CLF 0.022506
CLP 885.759871
CNY 6.75745
CNH 6.784495
COP 3435
CRC 455.716489
CUC 1
CUP 26.5
CVE 95.350078
CZK 20.80205
DJF 177.719866
DKK 6.43614
DOP 58.599944
DZD 132.878973
EGP 49.908197
ERN 15
ETB 158.375021
EUR 0.872821
FJD 2.2337
FKP 0.746465
GBP 0.757375
GEL 2.644999
GGP 0.746465
GHS 11.2977
GIP 0.746465
GMD 72.999684
GNF 8777.499016
GTQ 7.626359
GYD 209.290102
HKD 7.837295
HNL 26.697197
HRK 6.576702
HTG 130.666299
HUF 300.649642
IDR 17748.6
ILS 2.937345
IMP 0.746465
INR 94.309498
IQD 1310
IRR 1374999.999942
ISK 124.330031
JEP 0.746465
JMD 158.238482
JOD 0.709019
JPY 160.262999
KES 129.520178
KGS 87.449762
KHR 4012.493065
KMF 424.999812
KPW 900.00035
KRW 1511.864997
KWD 0.308098
KYD 0.8338
KZT 487.920041
LAK 22029.999804
LBP 89550.000054
LKR 335.185855
LRD 182.14983
LSL 16.194858
LTL 2.95274
LVL 0.60489
LYD 6.37502
MAD 9.245017
MDL 17.459223
MGA 4199.999949
MKD 53.086638
MMK 2099.945791
MNT 3579.382153
MOP 8.072446
MRU 40.080045
MUR 47.130241
MVR 15.460244
MWK 1736.000257
MXN 17.34925
MYR 4.064804
MZN 63.902105
NAD 16.201917
NGN 1359.119651
NIO 36.6101
NOK 9.74242
NPR 151.295881
NZD 1.73845
OMR 0.384498
PAB 1.000526
PEN 3.41251
PGK 4.38775
PHP 60.373009
PKR 278.298187
PLN 3.64767
PYG 6105.515298
QAR 3.640502
RON 4.507036
RSD 101.071054
RUB 72.971546
RWF 1488
SAR 3.751894
SBD 8.061424
SCR 14.115123
SDG 600.499323
SEK 9.59285
SGD 1.28203
SHP 0.746601
SLE 24.750291
SLL 20969.503664
SOS 571.507527
SRD 37.332026
STD 20697.981008
STN 21.4
SVC 8.754244
SYP 110.532098
SZL 16.19688
THB 32.534501
TJS 9.274765
TMT 3.51
TND 2.91175
TOP 2.40776
TRY 46.43713
TTD 6.796543
TWD 31.558502
TZS 2625.00297
UAH 44.808889
UGX 3701.565583
UYU 40.393596
UZS 12004.999858
VES 596.036397
VND 26326
VUV 118.988901
WST 2.739751
XAF 565.192704
XAG 0.015447
XAU 0.000239
XCD 2.70255
XCG 1.803205
XDR 0.703697
XOF 565.000179
XPF 103.250281
YER 238.625025
ZAR 16.450025
ZMK 9001.194926
ZMW 17.684109
ZWL 321.999592
  • CMSC

    0.0500

    22.37

    +0.22%

  • NGG

    -1.2400

    79.44

    -1.56%

  • RIO

    -2.5900

    100.08

    -2.59%

  • CMSD

    0.0000

    22.29

    0%

  • GSK

    -1.4800

    50.67

    -2.92%

  • BTI

    -0.5800

    58.91

    -0.98%

  • BP

    -1.0400

    39.1

    -2.66%

  • BCC

    3.8500

    74.66

    +5.16%

  • BCE

    0.0000

    23.28

    0%

  • RYCEF

    -0.0300

    18.4

    -0.16%

  • VOD

    -0.2300

    14.3

    -1.61%

  • JRI

    0.0500

    12.67

    +0.39%

  • RELX

    -0.8300

    31.18

    -2.66%

  • RBGPF

    -0.5300

    60.61

    -0.87%

  • AZN

    -2.9600

    174.93

    -1.69%

As AI data scrapers sap websites' revenues, some fight back
As AI data scrapers sap websites' revenues, some fight back / Photo: © AFP

As AI data scrapers sap websites' revenues, some fight back

A swarm of AI "crawlers" is running rampant on the internet, scouring billions of websites for data to feed algorithms at leading tech companies -- all without permission or payment, upending the online economy.

Text size:

Before the rise of AI chatbots, websites allowed search engines to access their content in return for increased visibility, a system that rewarded them with traffic and advertising revenues.

But the rapid development of generative AI has allowed tech giants like Google and OpenAI to harvest information for their chatbots with web crawlers, without humans ever needing to visit the original sites.

Traditional content producers, such as media outlets, are being outpaced by AI crawlers, which have cut into their online operations and advertising revenues.

"Sites that gave bots access to their content used to get readers in exchange," said Kurt Muehmel, head of AI strategy at data management firm Dataiku.

But the arrival of generative AI "completely breaks" that model, he told AFP.

Wikipedia's human internet traffic fell by eight percent between 2024 and 2025 because of a rise in AI search engine summaries, the online encyclopaedia reported last month.

"The fundamental tension is that the new business of the internet that is AI-driven doesn't generate traffic," said Matthew Prince, CEO of Cloudflare, an American internet services provider.

- 'No trespassing' -

Cloudflare, which processes more than 20 percent of all internet traffic, announced this summer a new measure aimed at blocking AI crawlers from accessing content without payment or permission from website owners.

"It's basically like putting a speed limit sign or a no trespassing sign," Prince told AFP on the sidelines of the Web Summit in Lisbon.

"Badly behaving bots can get by that, but we can track that... Over time, we can tighten these controls in a way that we're confident the AI companies can't get through."

The measure, which applies to more than 10 million websites, has already "attracted the attention of artificial intelligence giants", he added.

On a smaller scale, American startup TollBit is providing online news publishers with tools to block, monitor and monetise AI crawler traffic.

"The internet is a highway," said CEO and co-founder Toshit Panigrahi, who described the company as a "tollbooth on the internet".

TollBit works with more than 5,600 sites, including USA Today, Time magazine and the Associated Press, allowing media outlets to set their own access fees for their content.

The analytics are free for publishers, but AI companies are charged a "transaction fee for every piece of content they access".

But for Muehmel, the online takeover by AI crawlers cannot be resolved with only "partial measures or by an individual company".

"This is an evolution of the entire internet economy, which will take years," he said.

If the bot swarm continues to roam freely online, "all of the incentives for content creation are going to go away," Prince said.

"That would be a loss, not just for us humans that want to consume it, but actually for the AI companies that need original content in order to train their systems."

M.McCoy--TFWP