models aren't nearly good enough still we need way way more progress - Every model should be able to implement any paper into torchtitan without issue in less than 20 minutes before I'll start feeling pretty good lol
Loading thread detail
Fetching the original tweets from X for a clean reading view.
Hang tight—this usually only takes a few seconds.