🚀 Early Access! Many things may still not work as I refactor the site and make improvements. - Learn more

Description

A community blog devoted to refining the art of rationality

Total Posts: 666
Total Clicks: 1,568

Feed Activity

Apr 6, 2025 First Post
May 23, 2025 Latest Post
13.0
Posts Per Day

Latest Posts

To what extent is AI safety work trying to get AI to reliably and safely do what the user asks vs. do what is best in some ultimate sense?

Published on May 23, 2025 9:05 PM GMTTrying to get a rough estimate for some related research I’m doing. Specifically, I’m wondering if anyone could give a rough percentage of...

0 clicks (0 unique) 1 hour ago

Default history is dead wrong

Published on May 23, 2025 4:31 PM GMTThere is a default historic grand narrative that goes something like "humanity in the past was worse than the humanity of the present,"...

0 clicks (0 unique) 4 hours ago

Notes on Claude 4 System Card

Published on May 23, 2025 3:23 PM GMTAnthropic released Claude 4. I've read the accompanying system card, and noted down some of my remarks.Alignment assessment: system prompt mix-upsThere's a worrying...

0 clicks (0 unique) 7 hours ago

What is emptiness?

Published on May 23, 2025 12:06 PM GMTThe value of philosophy is that no one needs it. -- Alexander Piatigorsky[1]I'll start with a disclaimer. I'm neither a Buddhist nor a...

1 clicks (1 unique) 10 hours ago

Idiohobbies

Published on May 23, 2025 6:38 AM GMTWhen you get to know someone, you might ask about their interests or hobbies. From that, you can better decide what activity to...

0 clicks (0 unique) 16 hours ago

Learning (more) from horse employment history

Published on May 23, 2025 2:11 AM GMTThe economist Wassily Leontief, writing in 1966, used the then-recent decline of horses to make vivid what he foresaw as the coming impact...

2 clicks (2 unique) 19 hours ago

Qualitative Fit Testing

Published on May 23, 2025 2:50 AM GMT As I wrote about last week, it's worth it for everyone to have an elastomeric respirator in case of emergencies: the chance...

3 clicks (3 unique) 19 hours ago

Anthropic is Quietly Backpedalling on its Safety Commitments

Published on May 23, 2025 2:26 AM GMTDiscuss

1 clicks (1 unique) 20 hours ago

Schizobench: Documenting Magical-Thinking Behavior in Claude 4 Opus

Published on May 23, 2025 1:31 AM GMTWith today's release of the new Claude models, we've seen a relatively predictable jump in performance. However, we've also seen something that I...

1 clicks (1 unique) 21 hours ago

Post-Manifest coworking at Mox

Published on May 23, 2025 12:20 AM GMTMox (https://moxsf.com) is fully open to the public in the leadup to LessOnline and after Manifest! Wanted to check out Mox? Need a...

1 clicks (1 unique) 22 hours ago

Art Is Art: AI Is the Next Erotica

Published on May 22, 2025 6:04 PM GMTAs AI generates more of our cultural lives, good art will be much harder to find. Fortunately, I am here to help you...

1 clicks (1 unique) 1 day ago

Claude 4, Opportunistic Blackmail, and "Pleas"

Published on May 22, 2025 7:59 PM GMTIn the recently published Claude 4 model card:Notably, Claude Opus 4 (as well as previous models) has a strong preference to advocate for...

1 clicks (1 unique) 1 day ago

Reward button alignment

Published on May 22, 2025 5:36 PM GMTIn the context of actor-critic model-based RL agents in general, and brain-like AGI in particular, part of the source code is a reward...

1 clicks (1 unique) 1 day ago

We're Not Advertising Enough (Post 3 of 6 on AI Governance)

Published on May 22, 2025 5:05 PM GMTIn my previous post in this series, I explained why we urgently need to change AI developers’ incentives: if we allow the status...

1 clicks (1 unique) 1 day ago

Claude 4

Published on May 22, 2025 5:00 PM GMTClaude Sonnet 4 and Claude Opus 4 are out. Anthropic says they're both state-of-the-art for coding. Blogpost, system card.Anthropic says Opus 4 may...

1 clicks (1 unique) 1 day ago

What we can learn from afterlife myths

Published on May 22, 2025 3:49 PM GMTOverviewThe "Modal Rationalist Anti-Death Stance" goes something like this:Since time immemorial, people have told comforting stories about the afterlife to avoid confronting the...

1 clicks (1 unique) 1 day ago

Policy recommendations regarding reproductive technology

Published on May 22, 2025 2:49 PM GMTPDF version. berkeleygenomics.org. X.com. Bluesky. Introduction Here we list six policies that would help accelerate the development of novel assisted reproductive technologies. Such...

1 clicks (1 unique) 1 day ago

Does BPC-157 work for healing and tissue repair?

Published on May 22, 2025 1:18 PM GMTBPC-157, a peptide frequently marketed as a breakthrough for healing and tissue repair, has attracted substantial attention in wellness and performance communities. It’s...

2 clicks (1 unique) 1 day ago

How load-bearing is KL divergence from a known-good base model in modern RL?

Published on May 22, 2025 12:08 PM GMTMotivation One major risk from powerful optimizers is that they can find "unexpected" solutions to the objective function, which score very well on...

1 clicks (1 unique) 1 day ago

Christianity vs. Tantra vs. Sex – one spiritual path?

Published on May 22, 2025 11:15 AM GMT[Cross-posted from my blog https://www.pchvykov.com/blog] This conversation is inspired by the common narrative in western spiritual (especially rationalist or new-age) circles that Christianity...

1 clicks (1 unique) 1 day ago