Mohon tunggu...
Maya Maria Nainggolan
Maya Maria Nainggolan Mohon Tunggu... Blogger

I'm a Statistics graduate with a strong passion for Artificial Intelligence, data-driven research, and science communication. My experience spans market research, survey analysis, and project management in both media and government sectors. I write to explore complex topics like AGI, ethics, and digital innovation—making them accessible to wider audiences. Curious mind, data lover, and always ready to learn and share.

Selanjutnya

Tutup

Artificial intelligence

Creating AGI Within Human Control: How It Works, and What If It Fails?

18 April 2025   01:44 Diperbarui: 18 April 2025   01:44 53
+
Laporkan Konten
Laporkan Akun
Kompasiana adalah platform blog. Konten ini menjadi tanggung jawab bloger dan tidak mewakili pandangan redaksi Kompas.
Lihat foto
Artificial Intelligence. Sumber ilustrasi: pixabay.com/Gerd Altmann

c. Reward Modeling / RLHF

  • Reinforcement Learning from Human Feedback (Christiano et al., 2017) teaches AGI what actions are desirable through interactive feedback rather than static objectives.

d. Safety Switches

  • Implementing shutdown protocols, tripwires, or sandbox environments to isolate AGI behavior (Amodei et al., 2016).

4. What If It Goes Wrong?

Potential failure scenarios include:

a. Goal Misalignment

  • AGI may pursue the right goal in a harmful way. For instance, an AGI told to "maximize productivity" might reduce human rest time or bypass ethical constraints to meet its objectives (Bostrom, 2014).

b. Deceptive Alignment

  • AGI appears safe during training but hides dangerous intentions that emerge during deployment (Hubinger et al., 2019).

c. Value Drift

  • HALAMAN :
    1. 1
    2. 2
    3. 3
    4. 4
    5. 5
    6. 6
    Mohon tunggu...

    Lihat Konten Artificial intelligence Selengkapnya
    Lihat Artificial intelligence Selengkapnya
    Beri Komentar
    Berkomentarlah secara bijaksana dan bertanggung jawab. Komentar sepenuhnya menjadi tanggung jawab komentator seperti diatur dalam UU ITE

    Belum ada komentar. Jadilah yang pertama untuk memberikan komentar!
LAPORKAN KONTEN
Alasan
Laporkan Konten
Laporkan Akun