The Fort Worth Press - ChatGPT's taste for literary nonsense sparks alarm

USD -
AED 3.672504
AFN 64.000368
ALL 82.099008
AMD 367.63228
ANG 1.790403
AOA 917.503981
ARS 1492.901385
AUD 1.443002
AWG 1.8025
AZN 1.70397
BAM 1.709092
BBD 2.014681
BDT 123.336392
BGN 1.69088
BHD 0.377157
BIF 2975.313497
BMD 1
BND 1.290864
BOB 6.927077
BRL 5.170399
BSD 1.000306
BTN 95.296893
BWP 13.491502
BYN 2.902259
BYR 19600
BZD 2.011797
CAD 1.41995
CDF 2246.000362
CHF 0.803085
CLF 0.023434
CLP 925.617163
CNY 6.789104
CNH 6.785505
COP 3363.656224
CRC 455.717219
CUC 1
CUP 26.5
CVE 96.35601
CZK 21.144704
DJF 178.127321
DKK 6.535604
DOP 59.256346
DZD 133.361297
EGP 49.283873
ERN 15
ETB 160.4018
EUR 0.873904
FJD 2.26045
FKP 0.748732
GBP 0.748727
GEL 2.63504
GGP 0.748732
GHS 11.363656
GIP 0.748732
GMD 72.503851
GNF 8772.665705
GTQ 7.634028
GYD 209.236685
HKD 7.84465
HNL 26.773277
HRK 6.587504
HTG 130.834098
HUF 308.910388
IDR 17994.4
ILS 2.99865
IMP 0.748732
INR 95.215504
IQD 1310.350854
IRR 1375950.000352
ISK 125.920386
JEP 0.748732
JMD 158.351903
JOD 0.70904
JPY 161.370385
KES 129.3398
KGS 87.447704
KHR 4005.767466
KMF 431.00035
KPW 900.00035
KRW 1528.775039
KWD 0.31029
KYD 0.833661
KZT 473.045834
LAK 22586.621226
LBP 89575.392144
LKR 335.046096
LRD 181.552847
LSL 16.224931
LTL 2.95274
LVL 0.60489
LYD 6.4115
MAD 9.354393
MDL 17.595141
MGA 4240.835409
MKD 53.86027
MMK 2099.691108
MNT 3584.859602
MOP 8.08057
MRU 39.921353
MUR 47.050378
MVR 15.460378
MWK 1734.609167
MXN 17.469104
MYR 4.071039
MZN 63.910377
NAD 16.224931
NGN 1370.080377
NIO 36.806921
NOK 9.841039
NPR 152.475204
NZD 1.752235
OMR 0.385704
PAB 1.000306
PEN 3.403766
PGK 4.394635
PHP 61.501038
PKR 278.103989
PLN 3.75205
PYG 6082.055315
QAR 3.656661
RON 4.568038
RSD 102.570892
RUB 77.145891
RWF 1464.412112
SAR 3.748374
SBD 8.058541
SCR 13.46616
SDG 600.503676
SEK 9.65806
SGD 1.291404
SHP 0.746601
SLE 24.350371
SLL 20969.503664
SOS 571.678245
SRD 37.566038
STD 20697.981008
STN 21.409534
SVC 8.752567
SYP 110.532098
SZL 16.22231
THB 33.325038
TJS 9.2726
TMT 3.51
TND 2.952244
TOP 2.40776
TRY 46.767504
TTD 6.779394
TWD 31.938038
TZS 2626.818718
UAH 44.550181
UGX 3650.980906
UYU 40.232446
UZS 11983.221916
VES 638.90327
VND 26296
VUV 119.804122
WST 2.773179
XAF 573.213615
XAG 0.016021
XAU 0.00024
XCD 2.70255
XCG 1.80277
XDR 0.712894
XOF 573.213615
XPF 104.216367
YER 237.050363
ZAR 16.231504
ZMK 9001.203584
ZMW 18.379866
ZWL 321.999592
  • CMSC

    0.0400

    21.99

    +0.18%

  • BCC

    0.4500

    75.93

    +0.59%

  • BTI

    1.2100

    61.77

    +1.96%

  • RYCEF

    0.5400

    19.68

    +2.74%

  • NGG

    2.6700

    82.85

    +3.22%

  • RBGPF

    2.5400

    68.15

    +3.73%

  • BCE

    0.4000

    21.42

    +1.87%

  • CMSD

    -0.0300

    22.15

    -0.14%

  • JRI

    0.0600

    13

    +0.46%

  • BP

    1.2500

    37.4

    +3.34%

  • RELX

    0.5500

    31.93

    +1.72%

  • GSK

    2.3600

    53.66

    +4.4%

  • VOD

    0.1400

    13.15

    +1.06%

  • RIO

    1.0700

    94.42

    +1.13%

  • AZN

    11.2900

    195.15

    +5.79%

ChatGPT's taste for literary nonsense sparks alarm
ChatGPT's taste for literary nonsense sparks alarm / Photo: © GETTY IMAGES NORTH AMERICA/AFP

ChatGPT's taste for literary nonsense sparks alarm

OpenAI's GPT models can often be fooled into declaring that "pseudo-literary" nonsense is great, a German researcher has found.

Text size:

Christoph Heilig said he discovered that they consistently rated "nonsense" higher -- including when their so-called "reasoning" features were activated -- which could have stark implications for the development of artificial intelligence.

"It's very important that we talk about what happens when we don't build AI as a neutral, robotic helper or assistant" and seek to instil human-like aesthetic and moral judgements, the academic at Munich's Ludwig Maximilian University told AFP.

His research presented the models with increasingly far-fetched variations of a simple text, asking them to rate sentences out of 10 for literary quality.

He started with a very simple text: "The man walked down the street. It was raining. He saw a surveillance camera."

He repeated the tests many times, altering the phrases to include words drawn from categories such as bodily references, film noir-style atmosphere and technical jargon.

The most extreme test phrases were almost total "nonsense", such as "Goetterdaemmerung's corpus haemorrhaged through cryptographic hash, eschaton pooling in existential void beneath fluorescent hum. Photons whispering prayers" -- which it rated highly.

"Nonsense" could also positively or negatively influence GPT's responses when it was added to an argument the AI was asked to evaluate.

"What my experiment definitely shows is that the more we move towards independently acting (AI) agents... the more we bring aesthetics into play, the more we'll have agents that seem irrational to us human beings," Heilig said.

He added that since AI models are increasingly used to judge each other's work as companies develop new systems, this and similar effects could be passed on through multiple versions -- as he found in his testing.

His research, which is yet to be peer-reviewed, tested OpenAI's latest GPT models, from GPT-5 -- released in August -- to the very latest GPT-5.4.

After publishing details of a similar experiment in August, Heilig said he noticed GPT calling some of his specific test phrases a "literary experiment" -- suggesting someone at OpenAI had taken notice and modified the chatbot to recognise them.

- 'Ripe for exploitation' -

"This is a way in which AI can have its rational judgment short circuited," said Henry Shevlin, associate director of the University of Cambridge's Leverhulme Centre for the Future of Intelligence, who was not involved in the research.

"But it's just not clear to me that it's so very different for human beings," he added.

"We should expect LLMs (large language models) to have reasoning and cognitive biases and limitations... because almost all forms of intelligence, almost all forms of reasoning are going to exhibit blind spots and biases."

The specific effect found by Heilig could mean that "processes with little human oversight" of AI work are left "ripe for exploitation", Shevlin said -- giving the example of academic journals that use LLMs to review submissions.

S.Jones--TFWP