The Fort Worth Press - As AI data scrapers sap websites' revenues, some fight back

USD -
AED 3.672498
AFN 66.374624
ALL 82.891062
AMD 382.105484
ANG 1.790055
AOA 917.000265
ARS 1446.111798
AUD 1.509457
AWG 1.80125
AZN 1.69945
BAM 1.678236
BBD 2.018646
BDT 122.628476
BGN 1.678398
BHD 0.376991
BIF 2961.256275
BMD 1
BND 1.297979
BOB 6.925579
BRL 5.31099
BSD 1.002244
BTN 90.032049
BWP 13.315657
BYN 2.90153
BYR 19600
BZD 2.015729
CAD 1.394565
CDF 2229.999854
CHF 0.803415
CLF 0.023394
CLP 917.729983
CNY 7.07165
CNH 7.067635
COP 3796.99
CRC 491.421364
CUC 1
CUP 26.5
CVE 94.616395
CZK 20.762402
DJF 178.481789
DKK 6.410465
DOP 63.686561
DZD 130.081006
EGP 47.5783
ERN 15
ETB 156.280403
EUR 0.85828
FJD 2.261962
FKP 0.750125
GBP 0.749325
GEL 2.702059
GGP 0.750125
GHS 11.416779
GIP 0.750125
GMD 73.000012
GNF 8709.00892
GTQ 7.677291
GYD 209.68946
HKD 7.78435
HNL 26.389336
HRK 6.462502
HTG 131.282447
HUF 327.919498
IDR 16652
ILS 3.231155
IMP 0.750125
INR 90.007498
IQD 1312.956662
IRR 42124.999891
ISK 127.879701
JEP 0.750125
JMD 160.623651
JOD 0.709011
JPY 154.910502
KES 129.349486
KGS 87.449585
KHR 4014.227424
KMF 421.999977
KPW 899.992858
KRW 1471.139743
KWD 0.30686
KYD 0.83526
KZT 506.587952
LAK 21742.171042
LBP 89752.828464
LKR 309.374155
LRD 176.902912
LSL 17.013777
LTL 2.95274
LVL 0.60489
LYD 5.447985
MAD 9.247548
MDL 17.048443
MGA 4457.716053
MKD 52.892165
MMK 2099.902882
MNT 3550.784265
MOP 8.035628
MRU 39.710999
MUR 46.070097
MVR 15.409729
MWK 1737.95151
MXN 18.21685
MYR 4.1095
MZN 63.902189
NAD 17.013777
NGN 1450.250119
NIO 36.881624
NOK 10.105016
NPR 144.049872
NZD 1.732875
OMR 0.3845
PAB 1.002325
PEN 3.37046
PGK 4.251065
PHP 58.994993
PKR 283.139992
PLN 3.62913
PYG 6950.492756
QAR 3.663323
RON 4.369801
RSD 100.749025
RUB 75.955865
RWF 1458.303837
SAR 3.752867
SBD 8.223823
SCR 13.590725
SDG 601.501691
SEK 9.412745
SGD 1.295395
SHP 0.750259
SLE 22.999848
SLL 20969.498139
SOS 571.823287
SRD 38.643498
STD 20697.981008
STN 21.023817
SVC 8.769634
SYP 11056.894377
SZL 17.008825
THB 31.864504
TJS 9.210862
TMT 3.5
TND 2.941946
TOP 2.40776
TRY 42.528197
TTD 6.795179
TWD 31.256047
TZS 2439.99956
UAH 42.259148
UGX 3553.316915
UYU 39.265994
UZS 11939.350775
VES 248.585901
VND 26362.5
VUV 122.113889
WST 2.800321
XAF 562.862377
XAG 0.017228
XAU 0.000237
XCD 2.70255
XCG 1.806356
XDR 0.70002
XOF 562.867207
XPF 102.334841
YER 238.399242
ZAR 16.93296
ZMK 9001.196253
ZMW 23.026725
ZWL 321.999592
  • RBGPF

    0.0000

    78.35

    0%

  • CMSC

    0.0400

    23.48

    +0.17%

  • CMSD

    -0.0300

    23.32

    -0.13%

  • RIO

    -0.5500

    73.73

    -0.75%

  • NGG

    -0.5800

    75.91

    -0.76%

  • SCS

    -0.1200

    16.23

    -0.74%

  • GSK

    -0.4000

    48.57

    -0.82%

  • BTI

    0.5300

    58.04

    +0.91%

  • AZN

    -0.8200

    90.03

    -0.91%

  • BP

    -0.0100

    37.23

    -0.03%

  • RYCEF

    0.4600

    14.67

    +3.14%

  • RELX

    0.3500

    40.54

    +0.86%

  • BCC

    -2.3000

    74.26

    -3.1%

  • JRI

    0.0500

    13.75

    +0.36%

  • VOD

    0.0500

    12.64

    +0.4%

  • BCE

    0.0400

    23.22

    +0.17%

As AI data scrapers sap websites' revenues, some fight back
As AI data scrapers sap websites' revenues, some fight back / Photo: © AFP

As AI data scrapers sap websites' revenues, some fight back

A swarm of AI "crawlers" is running rampant on the internet, scouring billions of websites for data to feed algorithms at leading tech companies -- all without permission or payment, upending the online economy.

Text size:

Before the rise of AI chatbots, websites allowed search engines to access their content in return for increased visibility, a system that rewarded them with traffic and advertising revenues.

But the rapid development of generative AI has allowed tech giants like Google and OpenAI to harvest information for their chatbots with web crawlers, without humans ever needing to visit the original sites.

Traditional content producers, such as media outlets, are being outpaced by AI crawlers, which have cut into their online operations and advertising revenues.

"Sites that gave bots access to their content used to get readers in exchange," said Kurt Muehmel, head of AI strategy at data management firm Dataiku.

But the arrival of generative AI "completely breaks" that model, he told AFP.

Wikipedia's human internet traffic fell by eight percent between 2024 and 2025 because of a rise in AI search engine summaries, the online encyclopaedia reported last month.

"The fundamental tension is that the new business of the internet that is AI-driven doesn't generate traffic," said Matthew Prince, CEO of Cloudflare, an American internet services provider.

- 'No trespassing' -

Cloudflare, which processes more than 20 percent of all internet traffic, announced this summer a new measure aimed at blocking AI crawlers from accessing content without payment or permission from website owners.

"It's basically like putting a speed limit sign or a no trespassing sign," Prince told AFP on the sidelines of the Web Summit in Lisbon.

"Badly behaving bots can get by that, but we can track that... Over time, we can tighten these controls in a way that we're confident the AI companies can't get through."

The measure, which applies to more than 10 million websites, has already "attracted the attention of artificial intelligence giants", he added.

On a smaller scale, American startup TollBit is providing online news publishers with tools to block, monitor and monetise AI crawler traffic.

"The internet is a highway," said CEO and co-founder Toshit Panigrahi, who described the company as a "tollbooth on the internet".

TollBit works with more than 5,600 sites, including USA Today, Time magazine and the Associated Press, allowing media outlets to set their own access fees for their content.

The analytics are free for publishers, but AI companies are charged a "transaction fee for every piece of content they access".

But for Muehmel, the online takeover by AI crawlers cannot be resolved with only "partial measures or by an individual company".

"This is an evolution of the entire internet economy, which will take years," he said.

If the bot swarm continues to roam freely online, "all of the incentives for content creation are going to go away," Prince said.

"That would be a loss, not just for us humans that want to consume it, but actually for the AI companies that need original content in order to train their systems."

M.McCoy--TFWP