Practical and Ethical Implications of Artificial …9993adca-88d0-466c-a625...• HLMI: 90% of...

Thomas MetzingerGutenberg Research CollegePhilosophisches SeminarJohannes Gutenberg-Universität MainzD-55099 Mainz

Frankfurt Institute for Advanced StudiesD-60438 Frankfurt am Main

metzinger@uni-mainz.dehttp://www.philosophie.uni-mainz.de/metzinger/

Practical and Ethical Implications of Artificial General Intelligence (AGI)

Outline1. Thought experiment I:

The BAAN-Scenario

2. The Value Alignment Problem

3. Thought experiment II:The Bodhisattva-AI

Müller, Vincent C. and Bostrom, Nick (2016), ‘Future progress in artificial intelligence: A surveyof expert opinion’, in Vincent C. Müller (ed.), Fundamental Issues of Artificial Intelligence(Synthese Library; Berlin: Springer), 553-571.

• HLMI: 90% of top-100-AI-experts believe that human-level intelligence will be reached by2070; 50% expect the period of 2040-2050.

• SI: 75% of all surveyed experts (TOP100: only 50%) believe that a superintelligence will emerge within 30 years afterwards.

The BAAN-Scenario1. A superintelligence exists:

An autonomously self-optimizing postbiotic system has emerged, the rapidly growing factual knowledge and the general, domain-independent intelligence of which has superseded that of mankind, and irrevocably so.

2. We acknowledge this fact.

3. Accordingly, the superintelligence is also far superior to us in the domain of moral cognition.

4. We also recognize this additional aspect: We know that the superintelligence is not only an epistemic authority, but also

an authority in the field of ethical and moral reasoning.

5. The superintelligence is benevolent: It fully respects our interests and the axiology we originally gave to it, it

supports us in political counselling and in optimal social engineering.

The BAAN-Scenario (2)

5. The Superintelligence knows many things about us that we do not fully grasp or understand: It has a deeper understanding of the cognitive biases which evolution

has implemented in our cognitive self-model and which hinder us in rational, evidence-based moral cognition.

Empirically, it knows that the phenomenal states of all sentient beings which emerged on this planet—if viewed from an objective, impartial perspective—are much more frequently characterized by subjective qualities of suffering and frustrated preferences than these beings would ever be able to discover themselves, due to their evolutionarilydeveloped mechanisms of self-deception.

6. It correctly concludes that human beings are unable to act in their own enlightened, best interest.

5. The superintelligence knows that one of our highest values

consists in maximizing happiness and joy in all sentient beings,

and it fully respects this value.

However, it also empirically realizes that biological creatures are almost

never able to achieve a positive or even neutral life balance.

6. The superintelligence discovers a phenomenological asymmetry

between suffering and joy.

It concludes that an implicit, but even higher value consists in the

minimization of suffering in all sentient creatures.

7. It knows that no entity can suffer from its own non-existence.

8. The superintelligence concludes that non-existence is in

the own best interest of all sentient beings on this planet.

9. Empirically, it knows that naturally evolved biological

creatures are unable to realize this fact because of their

firmly anchored existence bias.

10. It analyzes itself and realizes that the potential for further

mental evolution without suffering is already sufficiently

secured by its own existence.

11. The superintelligence decides to act benevolently.

BAANDef.: „Benevolent Artificial Anti-Natalism“

The emergence of a

purely ethically motivated and

genuinely altruistic

form of anti-natalism

on epistemically superior postbiotic systems

is conceivable.

https://www.edge.org/conversation/thomas_metzinger-benevolent-artificial-anti-natalism-baan

Deutsch: https://www.nzz.ch/feuilleton/die-mitfuehlende-superintelligenz-die-boeses-schafft-ld.1334142

Anti-NatalismDef.:The normative thesis that mankind should voluntarily end its own existence.

Scenario A(X exists)

Scenario B(X never exists)

(1) Presence of pain(Bad)

(3) Absence of pain (Good)

(2) Presence of pleasure (Good)

(4) Absence of pleasure (Not bad)

This, bhikkhus, is the noble truth of suffering: birth is suffering, aging is suffering, illness is suffering, death is suffering; union with what is displeasing is suffering; separation from what is pleasing is suffering; not to get what one wants is suffering; in brief, the five aggregates subject to clinging are suffering.

Dhammacakkappavattana Sutta

VAP0 = internal value alignment in machines:Giving a formally consistent axiology to AI.→ easy

VAP1 = internal value alignment in humans:Making ourselves consistent.→ difficult

VAP2 = internal value alignment in and between different groups of human agents:Social value alignment.→ difficult

Value alignment:Making sets of value representations consistent

VAP3 = Stable individual-level human/machine consistency, after solving (VAP0 ∧ VAP1)

→ very difficult, mainly because of VAP1

VAP4 = Stable machine/society consistency, after solving (VAP0 ∧ VAP1 ∧ VAP3)

→ practically unsolvable

Making sets of value representationsconsistent

Asilomar AI Principle #10:Highly autonomous AI systems should be designed so that their goals and behaviors can be assured to align with human values throughout their operation.

→ The causal root of many major risk factors reallyare VAP1 and (partially derived) VAP2.

→Resources should be allocated to (VAP1 ∧ VAP2).

→Resources include AI itself!

What is intelligent risk minimization?

Variation: Scenario IIThe empirical premise

5. The Superintelligence knows many things about us that we do not fully grasp or understand: It has a deeper understanding of the cognitive biases which evolution

has implemented in our cognitive self-model and which hinder us in rational, evidence-based moral cognition.

Empirically, it knows that the phenomenal states of all sentient beings which emerged on this planet—if viewed from an objective, impartial perspective—are much more frequently characterized by subjective qualities of suffering and frustrated preferences than these beings would ever be able to discover themselves, due to their evolutionarilydeveloped mechanisms of self-deception.

6. It correctly concludes that human beings are unable to act in their own enlightened, best interest.

The Bodhisattva-AI8. The superintelligence concludes that enlightenment is in

the own best interest of all sentient beings on this planet.

9. Empirically, it knows that naturally evolved biological

creatures are unable to realize this fact because of their

firmly anchored existence bias.

10. It analyzes the human brain and realizes that a realistic

potential for further mental evolution without suffering

actually exists. → Premise 5 (b) is / can be made false.

11. The superintelligence decides to act benevolently.

The BAT-ScenarioBATDef.: „Benevolent Artificial Transhumanism“

The emergence of a

purely ethically motivated and

genuinely altruistic

form of transhumanism

on epistemically superior postbiotic systems

is conceivable.

Can we control the process?

Thomas MetzingerGutenberg Research CollegePhilosophisches SeminarJohannes Gutenberg-Universität MainzD-55099 Mainz

Frankfurt Institute for Advanced StudiesD-60438 Frankfurt am Main

metzinger@uni-mainz.dehttp://www.philosophie.uni-mainz.de/metzinger/

Practical and Ethical Implications of Artificial General Intelligence (AGI)

Basic Copyright Notice & Disclaimer

©2018 This presentation is copyright protected. All rights reserved. You may download or print out a hard copy for your private or internal use. You are not permitted to create any modifications or derivatives of this presentation without the prior written permission of the copyright owner.

This presentation is for information purposes only and contains non-binding indications. Any opinions or views expressed are of the author and do not necessarily represent those of Swiss Re. Swiss Re makes no warranties or representations as to the accuracy, comprehensiveness, timeliness or suitability of this presentation for a particular purpose. Anyone shall at its own risk interpret and employ this presentation without relying on it in isolation. In no event will Swiss Re be liable for any loss or damages of any kind, including any direct, indirect or consequential damages, arising out of or in connection with the use of this presentation.

Practical and Ethical Implications of Artificial …9993adca-88d0-466c-a625...• HLMI: 90% of...

Documents

ALUMNI, STUDENTS, FACULTY AND STAFF WHO ARE …files.ctctcdn.com/44615ebf101/26518c06-6dd8-4c55-88d0-59e1323146f8.pdfCollege illuminates with both simple and complex dimensions. Simplistically,

Graduate Academic Board · 2016-10-29 · Chg JUST A625 Seminar in Criminal Violation (3 cr) (3+0) (pg. 71-75) 1. ... approaches for the design of flexible and rigid pavement structures

heritage.noblenet.org › ... › MER_EDU_call.docx · Web viewCall Num. Title. Pub Year. Total Circs. LY Circs. L11 .A625. American Education. 1965. 0. 0. L11 .J95. The Clearing

CORROSION RESISTANT MATERIALS LTD · API 6A 718 AMS 5662 A718 AMS 5663 A718 A625 Grade 1 A625 Grade 2 A925 A945/X K500 Alloy C276 660A/D A800 A400 A600 A725 A825 070M20 ASTM A350

Meet your Career Advisor - Texas State Universitygato-docs.its.txstate.edu/jcr:9ae918e8-b585-4fbc-a625-6635d949e86… · Resume Writing Help Job Searching Assistance ... Editor: Denise

Amendment 25 SOB v1 (no alpha).book(numeridx.fm)€¦ · · 2016-07-06a623 79.85 a50 a624 61.25 a50 a625 157.00 a50 a626 105.25 a50 a628 38.05 a50 code $ h t p p1 p2 page. numeric

X182904501bkt outlined.indd 1 5/21/2012 10:38:07 AMdownload.microsoft.com/download/E/E/F/EEF20030-A625-48FF-AE55... · selectaţi adăugare dispozitive şi imprimante. WinDoWs 7:

SOCAR POLYMER POLYETHYLENE PRODUCT PORTFOLIO Portfolio for web-site rev.1... · HL 1052 BM HDPE 10 (HLMI - 21.6 kg) 0.952 Broad Medium & Large EBM –up to 60 L HL 0650 BM HDPE 6.0

‘ALES ADVERTENCIA RIESGO DE INCENDIO SEÑALES DE SEGURIDAD QUE ADVIERTEN DE UN PELIGRO 141 PROTECCtON A613 A625 A600 A603 A610 IGR CANONES A622 REFERENCIA 1930501 1930502 1930503

BERDIRI untuk NEGERI - phe.pertamina.comphe.pertamina.com/Upload/File/ef0a1811-944f-4e11-a625-f4b925c0b2c... · Terdapat lima bidang utama program pemberdayaan masyarakat, yakni mencakup

Spring Conference Wrap-up: Conference Steps up to …files.ctctcdn.com/bc29298e001/e6299905-88d0-45c7-a89d-7ca36e1227… · Spring Conference Wrap-up: Conference Steps up to Success

CNET Content Solutions Content Delivery Network - Color …cdn.cnetcontent.com/7c/bf/7cbfdbc8-c556-4ff5-a625-4800a... · 2013. 11. 11. · e installer or the print driver. Windows

Modulkatalog Politikwissenschaft Master Mono 12088812f1a-59e8-489a-a625... · 2019-04-03 · Themen sind zum Beispiel Verteilungs- und Gerechtigkeitstheorien, Risiko und Politik,

Walk the Way in a Day · Walk 50: Kinder Scout from Edale. page 1 Grinds Brook. Travelling from the east, the starting point is reached by turning off the A625 at Hope village and

産業医科大学麻酔科専門研修プログラムstudent.anesth.or.jp/user/filer_public/39/1f/391fc925-0de1-4b43-a625... · 門研修プログラム全般に共通する研修内容の特徴などは別途資料麻酔科専攻医研修マ

Coffee - Britad43052c3-88d0-4fbc-928c... · 2019-12-03 · sional fi lter division of BRITA at the world championships 2014 in Melbourne. For Christian, the good taste of coffee is

Manual HLMI Portugues Rev 01 - · PDF fileMANUAL DE INSTRUÇÃO Rua José Theodoro Ribeiro, 3571 89258-001 - Jaraguá do Sul - Santa Catarina ... XMTG-618T 8 TE ALM SET OUT AT

Island Princess Deck Plans - Classix Cruises€¦ · coral princess & island princess deck plans princess ... a706 a704 a702 a632 a630 lift lift lift lift a628 a626 a624 a625 a623

Andrew Hughes Hallett - Oesterreichische Nationalbank0b635ea0-2ff3-48ba-88d0-a8c1dbeaa04f/... · Andrew Hughes Hallett vowi_tagung_2004 Seite 176 15.12.2004,08:32 schwarz rot