🚀 Early Access! Many things may still not work as I refactor the site and make improvements. - Learn more

Marcos Feed

Friday, June 6, 2025

Does anyone have a good system for prioritising publishing drafts?

Published on June 6, 2025 4:58 PM GMTI've been thinking about ways to optimise the EA Forum to increase the likelihood that ideas go from peoples' heads to being published...

0 clicks (0 unique) 1 hour ago

DeepSeek-r1-0528 Did Not Have a Moment

Published on June 6, 2025 3:40 PM GMTWhen r1 was released in January 2025, there was a DeepSeek moment. When r1-0528 was released in May 2025, there was no moment....

0 clicks (0 unique) 2 hours ago

LLMs Blackmail to obtain Pathogen Sequences (And Lie About It)

Published on June 6, 2025 3:03 PM GMTThis post describes an experiment where we investigate AI agents' readiness to use blackmail in order to obtain sensitive pathogen sequences and where...

0 clicks (0 unique) 3 hours ago

Lessons from a year of university AI safety field building

Published on June 6, 2025 2:35 PM GMTThis post is an organizational update from Georgia Tech’s AI Safety Initiative (AISI) and roughly represents our collective view. In this post, we...

0 clicks (0 unique) 4 hours ago

Real-time voice translation

Published on June 6, 2025 7:40 AM GMTObjective Translate Alice's voice for Bob to hear in Bob's language. Translate Bob's voice for Alice to hear in Alice's language. Neither person...

1 clicks (1 unique) 10 hours ago

Liability for Misuse of Models - Dean Ball's Proposal

Published on June 6, 2025 5:34 AM GMTIntroductionThis article explores White House Office of Science and Technology Policy advisor Dean Ball's proposal as detailed in his paper "A Framework for...

1 clicks (1 unique) 13 hours ago

How do AI agents work together when they can’t trust each other?

Published on June 6, 2025 3:10 AM GMTI investigated this question by having Claude play the advanced social deduction game Blood on the Clocktower. Clocktower is a game similar to...

1 clicks (1 unique) 15 hours ago

Large Language Models suffer from Anterograde Amnesia

Published on June 6, 2025 1:30 AM GMTMemento (2000)My wife and I are power users of Large Language Models (LLMs). My go-to LLM has been Google Gemini, while she has...

0 clicks (0 unique) 17 hours ago

Discontinuous Linear Functions?!

Published on June 6, 2025 12:29 AM GMTWe know what linear functions are. A function f is linear iff it satisfies additivity f(x+y)=f(x)+f(y) and homogeneity f(ax)=af(x). We know what continuity...

0 clicks (0 unique) 18 hours ago

Avoiding AI Deception: Lie Detectors can either Induce Honesty or Evasion

Published on June 5, 2025 11:07 PM GMTLarge language models (LLMs) are often fine-tuned after training using methods like reinforcement learning from human feedback (RLHF). In this process, models are...

1 clicks (1 unique) 19 hours ago

Thursday, June 5, 2025

Introducing: Meridian Cambridge's new online lecture series covering frontier AI and AI safety

Published on June 5, 2025 9:55 PM GMTThis is a linkpost for [https://www.meridiancambridge.org/language-models-course]Meridian Cambridge, in partnership with Cambridge University's Center for Data Driven Discovery (C2D3),  has produced a 16-part lecture...

3 clicks (3 unique) 20 hours ago

cheaper sodium electrolysis

Published on June 5, 2025 9:49 PM GMTsodium electrolysis Aluminum metal is a widely-used material. It costs ~$2.5/kg. A significant fraction of its production cost is electricity. Currently, Na metal...

1 clicks (1 unique) 20 hours ago

Histograms are to CDFs as calibration plots are to...

Published on June 5, 2025 8:20 PM GMTAs you know, histograms are decent visualizations for PDFs with lots of samples...10k predictions, 20 bins ...but if there are only a few...

2 clicks (2 unique) 22 hours ago

Levels of Doom: Eutopia, Disempowerment, Extinction

Published on June 5, 2025 7:08 PM GMTDisempowerment is on the fence, gets interpreted as either implying human extinction or being a good place. "Doom" tends to be ambiguous between...

1 clicks (1 unique) 23 hours ago

LLM in-context learning as (approximating) Solomonoff induction

Published on June 5, 2025 5:45 PM GMTEpistemic status: One week empirical project from a theoretical computer scientist. My analysis and presentation were both a little rushed; some information that...

1 clicks (1 unique) 1 day ago

Fundamental Uncertainty: Chapter 2 - How do words get their meaning?

Published on June 5, 2025 4:32 PM GMTN.B. This is a chapter in a book about truth and knowledge. It's a major revision to the version of this chapter in...

1 clicks (1 unique) 1 day ago

Human opportunities in the age of AI

Published on June 5, 2025 3:49 PM GMTMuch of today's discussion around AI centers around current human labor that will be rendered meaningless. While the near-term will assuredly be disruptive,...

2 clicks (2 unique) 1 day ago

AI Might Kill Everyone

Published on June 5, 2025 3:37 PM GMT(Crosspost from my blog). (I’ll be at EAG London this weekend—come say hi. Also, this is my thousandth blogpost—cool milestone!)Several people have wondered...

1 clicks (1 unique) 1 day ago

WWDC25 Wallpaper

Decorate your desktop with these sleek new WWDC-inspired wallpapers.

0 clicks (0 unique) 1 day ago

Powerful Predictions

Published on June 5, 2025 10:44 AM GMTA thoughtful post by Anton Leicht, “Powerless Predictions,” notes that forecasting may be underutilized by AI policy organizations. But how much is forecasting...

2 clicks (2 unique) 1 day ago

Welcome to Postreads

12,072 articles across 53 sources

Follow your favorite sources, create custom feeds, and discover quality content from across the web. Use as your personal front page or private reading space.

Start Your Feed