bookmark · Added on December 29, 2025
· 1 min read
An online resource for homotopy-coherent mathematics
bookmark · Added on December 29, 2025
· 2h 54m read
video · Added on December 25, 2025
· 51:58 · 38.4K views
The inaugural Natural Philosophy Symposium was held in Baltimore on May 29-31, 2025. It was sponsored by the Natural Philosophy Forum at Johns Hopkins (https://...
video · Added on December 25, 2025
· 54:04 · 16.0K views
This presentation offers a heuristic proof (and simulations of a primordial soup) suggesting that life—or biological self-organization—is an inevitable and emer...
video · Added on December 25, 2025
· 1:17:34 · 9.4K views
Taken from: Logic for CS, Shai Ben-David, U Waterloo Fall 2015
https://www.youtube.com/channel/UCg9V0y9_RxG7hg5GjcyS2OA
Uploader: Adrian Apostol
Duration: 4654s...
video · Added on December 25, 2025
· 8:15 · 3.1K views
20-ish Questions shows a different side of Wyss Institute faculty, touching on aspects of their personal life, hobbies, interests, as well as their research. Th...
video · Added on December 25, 2025
· 1:30:21 · 10.1K views
ACCU Membership: https://tinyurl.com/ydnfkcyn
---
The Past, Present and Future of Programming Languages - Kevlin Henney - ACCU 2025
---
Programming languages ...
video · Added on December 25, 2025
· 17:30 · 115.5K views
An introduction to the Twin Prime Conjecture and sieve methods, one of the most beautiful branches of modern number theory.
ONLINE COURSE:
I'm teaching an onl...
video · Added on December 25, 2025
· 16:45 · 30.6K views
What does a mathematical theory consist of? This question was at the heart of the foundational dispute between the members of Bourbaki and practitioners of cate...
video · Added on December 25, 2025
· 57:41 · 7.8K views
This is a ~57 min talk titled "Whitehead on the Ingression of Novel Form: Toward a New Formal Causality in the Life Sciences" by Matt Segall (footnotes2plato.co...
video · Added on December 25, 2025
· 1:21:44 · 22.1K views
Prof. Richard Borcherds received a Fields medal in 1998. He is most famous for proving Monstrous Moonshine, a conjecture of John Conway and Simon Norton relatin...
video · Added on December 25, 2025
· 55:06 · 12.7K views
A pioneer in developmental and synthetic biology, Levin’s lab explores how bioelectric signaling among non-neural cells implements patterning, regeneration, and...
video · Added on December 25, 2025
· 1:08:52 · 4.6K views
This is a ~1 hour 8 minute talk titled "The ontogenetic alternative: “Platonism”, khôric mater(ial)ism, and open-ended evolution" by Timothy Jackson (https://sc...
video · Added on December 25, 2025
· 1:28:33 · 1.7K views
Interviewed by Dag Spicer on 2025-08-20 in Mountain View, CA
© Computer History Museum
This interview is with three principal designers and architects of the R...
video · Added on December 25, 2025
· 52:28 · 8.2K views
Millennium Prize Problems Lecture 11/12/2025
Speaker: Pierre Deligne, Institute for Advanced Study
Title: What is the Hodge conjecture?
Abstract: The Hodge c...
video · Added on December 25, 2025
· 56:17 · 14.8K views
first AGITTOC pseudolecture June 27, 2020 (see math216.wordpress.com for more on the AGITTOC experiment)
Also a first stab at playing with this technology.
Upl...
video · Added on December 25, 2025
· 43:58 · 1.2M views
Grigori Perelman proved the Poincare conjecture and then refused a million dollar prize (the Millennium Prize). He is the only mathematician who has declined th...
video · Added on December 25, 2025
· 54:30 · 3.5K views
William Zhang - TinyTPU
Uploader: GPU MODE
Duration: 3270s
Views: 3502
video · Added on December 25, 2025
· 55:44 · 5.1K views
All available speaker abstracts and slides can be found on our webpage - https://www.chapman.edu/scst/conferences-and-events/grothendieck-conference.aspx
Upload...
video · Added on December 25, 2025
· 43:22 · 3.8K views
My p-adic hat: https://www.bonfire.com/p-adic-hat-1/
My Patreon: https://patreon.com/K_Theory?utm_medium=unknown&utm_source=join_link&utm_campaign=creatorshare...
bookmark · Added on December 25, 2025
· 29 min read
bookmark · Added on December 25, 2025
· 16 min read
This is basically an expanded explanation of what I did after reading this tutorial by Marwan Burelle and then sitting down and trying to write my own implementation, so the steps are going to be fairly similar.
bookmark · Added on December 22, 2025
· 4 min read
We introduce NitroGen, a vision-action foundation model for generalist gaming agents that is trained on 40,000 hours of gameplay videos across more than 1,000 games.
bookmark · Added on December 22, 2025
· 10 min read
Many people have been misled by LLMs into believing they have an important breakthrough when they don't. If you think you have a breakthrough, please try the reality checks in this post (the first is fast and easy). If you're wrong, now is the best time to figure that out!
bookmark · Added on December 22, 2025
· 5 min read
A new model predicts, minute by minute, how individual cells will fold, divide, and rearrange during a fruit fly’s earliest stage of growth. The method may help scientists predict the development of more complex tissues or identify early signs of diseases such as asthma and cancer.
bookmark · Added on December 22, 2025
· 16 min read
MultiCell is a deep learning method to capture complex cell dynamics during multicellular development.
bookmark · Added on December 19, 2025
· 58 min read
From GPU architecture and PTX/SASS to warp-tiling and deep asynchronous tensor core pipelines.
bookmark · Added on December 19, 2025
· 1h 0m read
An open-source collection of core C++ library code
bookmark · Added on December 19, 2025
· 19 min read
We often think of optimization with momentum as a ball rolling down a hill. This isn't wrong, but there is much more to the story.
bookmark · Added on December 13, 2025
· 52 min read
An axiomatic theory of truth is a deductive theory of truth as a primitive undefined predicate.
bookmark · Added on December 13, 2025
· 3 min read
I am having a lot of trouble with the concept of Tarski's undefinability theorem as it relates to set theory.
Tarski's undefinability theorem says that there is no formula $Tr$ on the natural numbers
bookmark · Added on December 13, 2025
· 12 min read
Since the release of my preprint with Tim, Ben, and Freddie proving the Polynomial Freiman-Ruzsa (PFR) conjecture over $latex {\mathbb F}_2$, I (together with Yael Dillies and Bhavik Mehta) have st…
bookmark · Added on December 13, 2025
· 1h 36m read
bookmark · Added on December 13, 2025
· 6 min read
Contribute to JOSHCLUNE/LeanHammer development by creating an account on GitHub.
bookmark · Added on December 13, 2025
· 2 min read
Below is the list of BOLT project ideas with a brief description of each. Once a project is picked for active development, expect to start a new topic and file an RFC when suitable. Comment on this thread to add new ideas to the list. CFG Disassembler. BOLT symbolizes disassembly output and reconstructs control flow for detected functions, including that for indirect branches corresponding to jump tables. Such functionality by itself is useful for analyzing binary code. BOLT outputs the cont...
bookmark · Added on December 13, 2025
· 1 min read
This repository contains various json files indexing sorries in public Lean 4 repositories. They are generated using the crawler developed in the SorryDB project.
bookmark · Added on December 13, 2025
· 15 min read
The complete guide to understanding the concept of speculative decoding in LLM inference and implementing it from scratch
bookmark · Added on December 13, 2025
· 1 min read
I found these papers/links useful when writing the PP and GP compiler backends, so I'm saving them here. I'll try to list them front->back t...
bookmark · Added on December 13, 2025
· 10 min read
Language models memorize substantial parts of their training data. For example, prompting Llama 3.
bookmark · Added on December 13, 2025
· 1 min read
Source code for the "Introduction to embedded Rust" talk - swallez/intro-embdedded-rust
bookmark · Added on December 10, 2025
· 3h 18m read
Synthetic biology and bioengineering provide the opportunity to create novel embodied cognitive systems (otherwise known as minds) in a very wide variety of ...
bookmark · Added on December 9, 2025
· 8h 17m read
bookmark · Added on December 9, 2025
· 7 min read
Refresh and Reload: Secrets of the Unified Buffer
bookmark · Added on December 8, 2025
· 12 min read
This is the lightning(-ish) talk I gave at TigerBeetle 1000x World Tour Belgrade,
transcribed to article form. Huge thanks to Ludwig
for hosting the event and the TigerBeetle folks
for setting it up.
The slides are available in PDF format as well.
bookmark · Added on December 8, 2025
· 29h 15m read
bookmark · Added on December 8, 2025
· 5 min read
WikiChip is the preeminent resource for computer architectures and semiconductor logic engineering, covering historical and contemporary electronic systems, technologies, and related topics.
bookmark · Added on December 2, 2025
· 7 min read
Type checkers don't have to be crazy complicated.
bookmark · Added on December 2, 2025
· 21 min read
A deep dive into prompt caching - practical tips to improve cache hits and how vLLM's paged attention enables KV-cache reuse across requests via automatic prefix-caching
bookmark · Added on December 2, 2025
· 15 min read
Is the traditional 2D imaging model nearing the end of its usefulness, or does it have a shiny future in the “modern graphics” world? I spent a week on a research retreat in a cottage in the woods to answer this question, as it shapes the future of UI toolkits. Performant UI must use GPU effectively, and it’s increasingly common to write UI directly in terms of GPU rendering, without a 2D graphics API as in the intermediate layer. Is that the future, or perhaps a mistake?
bookmark · Added on December 2, 2025
· 4 min read
Sensory coding 2: electric boogaloo
bookmark · Added on November 24, 2025
· 45 min read
The personal website of Elias Daler about programming, technology and videogames
bookmark · Added on November 17, 2025
· 3 min read
We demonstrate an open-source bitwise consistent on-policy RL run with TorchTitan as the training engine and vLLM as the inference engine. Built on top of vLLM’s recent work on batch-invariant inference, we show how to run an RL fine-tune of Qwen3 1.7B with bitwise matching training and inference numerics in our open-sourced instructions:
bookmark · Added on November 17, 2025
· 1 min read
This six-session course will introduce participants to thinking about physical interaction as communication, and hence thinking about physical systems as communicating agents.
bookmark · Added on November 17, 2025
· 2 min read
Textbook on Theoretical Computer Science by Boaz Barak
bookmark · Added on November 17, 2025
· 37 min read
bookmark · Added on November 9, 2025
· 36 min read
I love the feeling of having a new way to think about the world. I especially love when there’s some vague idea that gets formalized into a concrete concept.
bookmark · Added on November 7, 2025
· 12 min read
Fil-C ensures memory safety of all operations in the C and C++ language. The hardest part of C memory safety is pointer safety.
bookmark · Added on November 7, 2025
· 8 min read
Fil-C uses a parallel concurrent on-the-fly grey-stack Dijkstra accurate non-moving garbage collector called FUGC (Fil's Unbelievable Garbage Collector).
bookmark · Added on November 6, 2025
· 14 min read
Explains Transparent Hugepages in a nutshell, techniques that can be used to measure the performance impact, shows the effect on a real-world application.
bookmark · Added on November 6, 2025
· 41 min read
Platonism about mathematics (or mathematical platonism) is the metaphysical view that there are abstract mathematical objects whose existence is independent of us and our language, thought, and practices.
bookmark · Added on November 6, 2025
· 8 min read
In an OLTP database processing 1 million transactions per second, every microsecond matters. Each transaction allocates dozens of temporary objects: query ...
bookmark · Added on November 6, 2025
· 2 min read
Proof theory is not an esoteric technical subject that was invented to support a formalist doctrine in the philosophy of mathematics; rather, it has been developed as an attempt to analyze aspects of mathematical experience and to isolate, possibly overcome, methodological problems in the foundations of mathematics.
video · Added on November 5, 2025
· 55:03 · 11.2K views
How do you type-check 1.8 million lines of Python per second? Neil Mitchell explains how Pyrefly (a new Python type checker) achieves this level of performance....
video · Added on November 5, 2025
· 1:01:28 · 2.5K views
This is an invited talk in BAMΞ's Mathematical Phenomenology Sprint.
Cf. https://bamxi.org/research-activities/mathematical-phenomenology-sprint/
Organizing In...
video · Added on November 5, 2025
· 1:28:15 · 3.1K views
This is a ~1 hour 30 minute talk + Q&A by Chris Fields (https://allencenter.tufts.edu/christopher-a-chris-fields-ph-d/) titled "From Experience to Math", given ...
video · Added on November 5, 2025
· 51:17 · 3.2K views
This is a ~50-minute talk titled "Substrate-dependent mathematics hypothesis" by Olaf Witkowski (https://olafwitkowski.com/), presented for our Platonic Space s...
bookmark · Added on November 4, 2025
· 1 min read
I am happy to announce that a draft of my upcoming book “Control structures in programming languages: from goto to algebraic effects” is now available at https://xavierleroy.org/control-structures . The book compares several programming languages from the standpoint of control structures. OCaml is used intensively to discuss control in functional programming, including continuation-passing style, control operators, exceptions, user-defined effects and effect handlers, with many examples that I...
bookmark · Added on October 30, 2025
· 1 min read
video · Added on October 28, 2025
· 1:01:03 · 7.2K views
Analysis and Mathematical Physics
2:30pm|Simonyi Hall 101 and Remote Access
Topic: Towards a Geometric Theory of Deep Learning
Speaker: Govind Menon
Affiliation...
bookmark · Added on October 27, 2025
· 7 min read
Theory of Diversity (RL) - Powered by Obsidian Publish.
bookmark · Added on October 27, 2025
· 1 min read
Libraries have been trying to collect humanity’s knowledge almost since the invention of writing. In the digital age, it might actually be possible to create a comprehensive collection of all human writing meeting certain criteria That’s what shadow libraries do - collect and share as many books as
bookmark · Added on October 26, 2025
· 36 min read
a loss plateau that looked like my mistake turned out to be a PyTorch bug. tracking it down meant peeling back every layer of abstraction, from optimizer internals to GPU kernels.
bookmark · Added on October 26, 2025
· 25 min read
Blogpost
bookmark · Added on October 15, 2025
· 6 min read
My name is Marc Brooker. I've been writing code, reading code, and living vicariously through computers for as long as I can remember.
bookmark · Added on October 13, 2025
· 37 min read
Pushing single-GPU inference throughput to the edge without libraries
video · Added on October 6, 2025
· · views
During the offensive on Pervomaiske aka “Pervo”, the Chosen Company sustained massive casualties making the Russians believe that they had obliterated the entir...
video · Added on October 6, 2025
· 49:33 · 412.0K views
Mecha BREAK is finally here! https://t.mechabreak.com/c/yfzzso Thank you to Amazing Season Games for Sponsoring this video
►Follow my Twitch! https://www.twitc...
video · Added on October 6, 2025
· · views
Become a member and give gum shots: https://www.youtube.com/channel/UCLtnLkDf4nsXIIcr8orHUHA/join
According to Lex Fridman, he's the most brilliant man on the...
video · Added on October 6, 2025
· 50:18 · 8.5K views
As #rust gets more and more production usage, many of the examples people talk about are fairly high level: things like web applications. While that’s great, is...
video · Added on October 6, 2025
· 42:12 · 2.9K views
The CERN Large Hadron Collider (LHC) generates an unprecedented O(10,000) exabytes of raw data annually from high-energy proton collisions. Managing this vast d...
video · Added on October 6, 2025
· 52:40 · 94.9K views
Teaching Tom Crawford a bit about my favorite subject -- Lie algebras.
Check out Part 2: https://www.youtube.com/watch?v=ap7GZKCcgS8
🌟Support the channel🌟
Pat...
video · Added on October 6, 2025
· 29:07 · 39.1K views
What are the nature of physical reality, and the nature of consciousness? These two questions have remained unanswered since the beginning of the human race. Sc...
video · Added on October 6, 2025
· 37:20 · 1.1M views
Diffusion models, CLIP, and the math of turning text into images
Welch Labs Book: https://www.welchlabs.com/resources/imaginary-numbers-book
Sections
0:00 - In...
video · Added on October 6, 2025
· 1:16:17 · 3.3K views
To try this awesome whiteboard:
📌 [Free whiteboard] https://tldraw.com/?utm_source=youtube&utm_medium=socials&utm_campaign=standard&utm_term=yacinemahdid
📌 [SD...
video · Added on October 6, 2025
· 1:52:46 · 40.6K views
Sam H. Smith's talk at BSC 2025 about implementing AST-free compilers and optimizing with sea of nodes.
Sam's links:
- https://x.com/SamHSmith2
- https://samhs...
video · Added on October 6, 2025
· 37:58 · 208.4K views
Code/Writeup/Resources: https://github.com/cnlohr/lolra
LoLRa Merch: https://cnlohr-shop.fourthwall.com/
Patreon: https://patreon.com/cnlohr
Memes, in order of...
video · Added on October 6, 2025
· 46:07 · 1.4M views
Try SendCutSend 15% off for your next project! https://sendcutsend.com/breakingtaps/
My descent into madness, chasing one micrometer.
Watch this ad-free on Ne...
video · Added on October 6, 2025
· 41:31 · 7.2K views
Compose NYC 2019
Speaker: David Christiansen
When implementing a type checker, one must answer two questions: how to compare types for sameness, be it a subsum...
video · Added on October 6, 2025
· 1:25:16 · 2.6K views
https://www.cppnow.org
---
Beyond Sequential Consistency - Leveraging Atomics for Fun and Profit - Christopher Fretz - C++Now 2025
---
In 2011, C++ introduce...
video · Added on October 6, 2025
· 46:01 · 7.1K views
A Celebration of Mathematics and Computer Science
Celebrating Avi Wigderson's 60th Birthday
October 5 - 8, 2016
More videos on http://video.ias.edu
Uploader: I...
video · Added on October 6, 2025
· 19:04 · 21.6K views
Welcome to the 2nd Episode of our Tennis Elbow 4 New Player Guide mini-series!
In this video, I will show you HOW TO PLAY Tennis Elbow 4. We will dive deep in...
video · Added on October 6, 2025
· 23:41 · 134.3K views
https://Patreon.com/ThinkingFootball
Our beats can be found here! ⬇️⬇️⬇️
https://twtw.bandcamp.com/
#nfl #cfb #interception
Uploader: Thinking Football
Durat...
video · Added on October 6, 2025
· 46:45 · 9.1K views
This is a ~46 minute talk given at a conference on consciousness, presenting my lab's data on cognition in body organs outside the brain and then some speculati...
video · Added on October 6, 2025
· 57:24 · 47.5K views
Avi Wigderson is a professor of Mathematics at the Institute for Advanced Study in Princeton. After studying Computer Science at Technion in Haifa, he obtained ...
video · Added on October 6, 2025
· 56:12 · 3.1K views
This is a ~ 1 hour and 10 minute talk plus ~30 minutes Q&A discussion with the computational group in our Center, titled "A Theoretical Computer Science Lens on...
video · Added on October 6, 2025
· 19:07 · 34.0K views
Coaching form: https://forms.gle/N2ue7X8YzoTRtLJM7
Where To Follow Me!
My Twitch: https://www.twitch.tv/piggyxdd
My Twitter: https://twitter.com/piggyxdd
Pig...
video · Added on October 6, 2025
· 17:01 · 47.1K views
Links:
- Patreon (Support the channel directly!): https://www.patreon.com/Asianometry
- X: https://twitter.com/asianometry
- Newsletter & Podcast (available thr...
bookmark · Added on October 2, 2025
· 16 min read
In this essay, I provide some advice to up-and-coming researchers in machine learning (ML), based on my experience doing research and advising others.
bookmark · Added on September 29, 2025
· 3 min read
Posts and writings by Julian Schrittwieser
bookmark · Added on September 25, 2025
· 1 min read
Dr. Evil learns that a duplicate of Dr.
bookmark · Added on September 17, 2025
· 1h 32m read
How did life on Earth begin? In the nineteenth century, this seemed like an unanswerable question.
bookmark · Added on September 15, 2025
· 1h 1m read
Stephen Wolfram explains the rich ruliology of lambdas, made particularly significant by their connection to practical computing. Covers basic computations to undecidability to multiway graphs and evaluation strategies.
bookmark · Added on September 8, 2025
· 14 min read
I forgot to cancel my Midjourney v7 subscription last month. I love Midjourney, amazing model and great product. I have been short on ideas and, honestly, co...
bookmark · Added on September 8, 2025
· 17 min read
Fang-Pen Lin's blog about programming
bookmark · Added on September 2, 2025
· 1 min read
bookmark · Added on September 1, 2025
· 16 min read
Plus some lore
bookmark · Added on September 1, 2025
· 35 min read
From paged attention, continuous batching, prefix caching, specdec, etc. to multi-GPU, multi-node dynamic serving at scale.
bookmark · Added on September 1, 2025
· 15 min read
When training large scale LLMs, there is a large assortment of parallelization strategies which you can employ to scale your training runs to work on more GPUs.
bookmark · Added on August 29, 2025
· 1 min read
Tutorial for learning about solving partially observable Markov decision processes (POMDPs).
bookmark · Added on August 26, 2025
· 9 min read
We discovered why language models catastrophically fail on long conversations: when old tokens are removed to save memory, models produce complete gibberish. We found models dump massive attention onto the first few tokens as "attention sinks"—places to park unused attention since softmax requires weights to sum to 1. Our solution, StreamingLLM, simply keeps these first 4 tokens permanently while sliding the window for everything else, enabling stable processing of 4 million+ tokens instead of just thousands. This mechanism is now in HuggingFace, NVIDIA TensorRT-LLM, and OpenAI's latest models.
bookmark · Added on August 26, 2025
· 9 min read
How can a language model comprehend a million-token document without drowning in O(N²) attention cost? A statistical model revealing the success of block sparse attention through learned similarity gaps.
bookmark · Added on August 19, 2025
· 1h 15m read
Gödel’s two incompleteness theorems are among the most important results in modern logic, and have deep implications for various issues.
bookmark · Added on August 19, 2025
· 5 min read
We’ve all been there before: by the time you start graduate school in Princeton, you’ve already invented the Turing machine, pioneered the concept of computational universality, and pro…
bookmark · Added on August 19, 2025
· 18 min read
On Thursday, I probably should've told you explicitly that I was compressing a whole math course into one lecture. On the one hand, that means I don't really expect you to have understood everything.
bookmark · Added on August 19, 2025
· 18 min read
A global pandemic, apocalyptic fires, and the possible descent of the US into violent anarchy three days from now can do strange things to the soul. Bertrand Russell—and if he’d done no…
bookmark · Added on August 19, 2025
· 5 min read
Note: I don't think this idea is original, but I couldn't find a good post going over the implications. …
bookmark · Added on August 19, 2025
· 1h 22m read
(Originally posted in December 2015: A dialogue between Ashley, a computer scientist who's never heard of Solomonoff's theory of inductive inference,…
bookmark · Added on August 19, 2025
· 31 min read
> It can scarcely be denied that the supreme goal of all theory is to make the irreducible basic elements as simple and as few as possible without having to surrender the adequate representation of a…
bookmark · Added on August 19, 2025
· 46 min read
We love TPUs at Google, but GPUs are great too. This chapter takes a deep dive into the world of NVIDIA GPUs – how each chip works, how they’re networked together, and what that means for LLMs, especially compared to TPUs. This section builds on
Chapter 2 and
Chapter 5, so you are encouraged to read them first.
bookmark · Added on August 19, 2025
· 24 min read
Usually, people start these ‘critiques’ with a disclaimer that they are not trying to trash the framework, and talk about how it’s a tradeoff.
bookmark · Added on August 19, 2025
· 4 min read
Taste has become a bit of a buzz-word, at least among VC-types. Taste is a philosophy, it must run deep in the core of your business, they say.
bookmark · Added on August 19, 2025
· 7 min read
Personal Blog
bookmark · Added on August 19, 2025
· 19 min read
Personal Blog
bookmark · Added on August 19, 2025
· 6 min read
Personal blog of Stephen Diehl - Software engineer writing about technology, programming, and the future
bookmark · Added on August 19, 2025
· 7 min read
The glorious Glasgow Haskell Compilation system, since around version 6. 10 has had support for indexed type familes, which let us represent functional relationships between types.
bookmark · Added on August 18, 2025
· 5 min read
An attempt to understand and build a TPU—by complete novices.
bookmark · Added on August 18, 2025
· 2 min read
Hunyuan-GameCraft
bookmark · Added on August 18, 2025
· 1h 25m read
Another explainer on a fun, esoteric topic: optimizing code with SIMD (single instruction multiple data, also sometimes called vectorization).
bookmark · Added on August 12, 2025
· 11 min read
We all know how deep learning works.
bookmark · Added on August 12, 2025
· 14 min read
We recently proposed Muon: a new neural net optimizer.
bookmark · Added on August 11, 2025
· 6 min read
[This work was performed as my final project for ARENA 5.0.] …
bookmark · Added on August 11, 2025
· 37 min read
This is the completed article that Luke wrote the first half of. My thanks go to the following for reading, editing, and commenting; Luke Muehlhauser…
bookmark · Added on August 5, 2025
· 6 min read
SemiconSam Original Report
bookmark · Added on August 5, 2025
· 23 min read
Hilbert's problems are 23 problems in mathematics published by German mathematician David Hilbert in 1900.
bookmark · Added on August 5, 2025
· 1 min read
bookmark · Added on August 5, 2025
· 45 min read
bookmark · Added on August 4, 2025
· 1h 48m read
Ludwig Wittgenstein’s Philosophy of Mathematics is undoubtedly the most unknown and under-appreciated part of his philosophical opus.
bookmark · Added on July 30, 2025
· 19 min read
Jujutsu (jj) is a version control system with a significantly simplified mental model and command-line interface compared to Git, without sacrificing expressibility or power (in fact, you could argue Jujutsu is more powerful).
bookmark · Added on July 30, 2025
· 1 min read
Cricket is a lazy gradually-typed functional language with objects. It's very tiny but very expressive; anyone can implement it themselves!
bookmark · Added on July 30, 2025
· 1 min read
This post introduces the basic ideas behind dependent-type-based proof assistants, and expressing logic with types and values.
bookmark · Added on July 30, 2025
· 1 min read
This post briefly maps out many different subfields of programming language theory, in an effort to make it more accessible to those outside academia.
bookmark · Added on July 30, 2025
· 1 min read
Category theory is a beautiful and powerful field but it can feel impenetrable without the right entry point. This post hopes to serve as a sort of beginner's guide and reference.
bookmark · Added on July 30, 2025
· 1 min read
Sequent Calculus is a way of doing logic that's very explicit and mechanical. It's used as an important system and notation for type theory and logic related to programming languages.
bookmark · Added on July 29, 2025
· 1 min read
the quick and dirty of Lean
bookmark · Added on July 26, 2025
· 1 min read
Over at Cosmic Variance, I learned that FQXi (the organization that paid for me to go to Iceland) sponsored an essay contest on “The Nature of Time”, and the submission deadline was las…
bookmark · Added on July 25, 2025
· 1 min read
This is a series of lectures aimed at graduate students on the modern design of full-spectrum dependent type theories, such as the core calculi of proof assistants like Agda, Coq, and Lean.
video · Added on July 24, 2025
· · views
Deep technical walkthrough of the “3D Gaussian Splatting” paper, explaining the algorithm, rendering pipeline, and codebase for real-time NeRF-style scene recon...
video · Added on July 24, 2025
· 30:59 · 49.4K views
Fireside keynote with Linus Torvalds covering the evolution of the Linux kernel, open-source development practices, and future directions of operating-system en...
video · Added on July 24, 2025
· · views
Stream shows iterative development of a custom video editor, implementing multi-clip support and interactive GUI controls for clip bounds, demonstrating real-wo...
video · Added on July 24, 2025
· · views
Conversation detailing why ThePrimeagen runs a dedicated Linux workstation, discussing kernel-level efficiency, tooling, and developer workflow benefits compare...
video · Added on July 24, 2025
· 8:36 · 105.6K views
Jonathan Blow critiques typical computer-science curricula, proposing alternative course structures and teaching approaches aimed at producing stronger practica...
video · Added on July 24, 2025
· · views
In-depth explanation of Zig’s vector types and how they compile down to SIMD instructions, illustrating low-level data layout, intrinsic operations, and perform...
video · Added on July 24, 2025
· 2:13:39 · 26.6K views
Live-coding session in C that modifies and debugs Raylib to extend the Musializer project, covering real-time rendering techniques, event handling, and practica...
video · Added on July 24, 2025
· 3:14:02 · 59.0K views
Walk-through of implementing a classic ray-casting engine in TypeScript, rendering a 3D maze in the browser while explaining projection math and performance con...
video · Added on July 24, 2025
· · views
Silent “ASMR” coding demo that builds a minimal client/server socket program in Zig, covering socket creation, binding, listening, and message exchange.
video · Added on July 24, 2025
· 1:31:00 · 8.6K views
Detailed lecture on foundational CUDA performance techniques—memory coalescing, occupancy, and kernel launch parameters—illustrated through hands-on code profil...
video · Added on July 24, 2025
· 1:53:38 · 3.2K views
Live-coded session that implements a bump allocator from scratch, demonstrates eliminating dynamic malloc calls during module initialization, and refactors depe...
video · Added on July 24, 2025
· · views
Explores the geometric and trigonometric principles behind aiming and banking shots in pool, showing how players use a “diamond system” to compute angles and sp...
video · Added on July 24, 2025
· 2:10:45 · 2.2K views
Live-coding session that compiles a Zig simulation to WebAssembly, deploys it server-side and hooks it to a browser client, demonstrating build flags, WASM bind...
video · Added on July 24, 2025
· 9:57 · 9.6K views
Step-by-step tutorial showing how to write and use foreign-function interfaces in Gleam to call Erlang and JavaScript code, including syntax, default impls, and...
video · Added on July 24, 2025
· · views
Uses Minecraft’s Redstone to illustrate how real computers are built, explaining logic gates, registers, ALUs, memory and clocking at the hardware / low-level c...
video · Added on July 24, 2025
· · views
Walkthrough solution of a Japanese Math Olympiad problem requiring creative high-school level mathematics to determine an unknown value X.
video · Added on July 24, 2025
· · views
In-depth exploration of Linux signal handling internals, tracing the kernel path from interrupt to sigreturn, showing register preservation, stack manipulation,...
video · Added on July 24, 2025
· · views
Practical first-look tutorial on installing, configuring, and effectively using the Yabai tiling window manager on macOS to improve keyboard-driven window manag...
video · Added on July 24, 2025
· · views
Live-coding session showing the step-by-step design and implementation of a retro ray-casting 3-D engine in C, covering rendering math, engine architecture, and...
video · Added on July 24, 2025
· 18:15 · 84.5K views
Step-by-step lesson introducing ARM assembly programming—registers, MOV instruction, SWI syscall, compiling, and emulation—providing foundational skills for low...
video · Added on July 24, 2025
· 22:54 · 238.9K views
Jonathan Blow critiques modern software practices and argues for simpler, more efficient programming models, discussing language and tooling design principles t...
video · Added on July 24, 2025
· 43:19 · 75.7K views
Andrew Kelley outlines the 2023 roadmap for the Zig programming language, detailing planned language features, compiler back-end work, tooling, and data-oriente...
video · Added on July 24, 2025
· · views
Rapid-fire tutorial summarizing 100+ foundational Linux concepts—kernel vs GNU, essential commands, distro choices—aimed at beginners setting up and using Linux...
video · Added on July 24, 2025
· · views
Project showcase guiding viewers through designing and breadboarding a custom CPU (GIBCPU), covering instruction set planning, control logic, ALU implementation...
video · Added on July 24, 2025
· · views
Conference talk modifying Doom’s source to use intentionally incorrect π and trigonometric constants, demonstrating how breaking fundamental maths produces non-...
video · Added on July 24, 2025
· 1:02:03 · 1.7M views
Deep dive into the 8-bit game Elite, showing how procedural generation, split-screen 3D, back-face culling and tight 6502 assembly fit a full universe into 22 K...
video · Added on July 24, 2025
· 20:04 · 489.3K views
Historical and technical tour of virtual memory—paging, MMUs and TLBs—from the Atlas supercomputer to modern x86/ARM/RISC-V, detailing how hardware and OS coope...
video · Added on July 24, 2025
· · views
Walk-through of the SQLite 2.x B-tree page-balancing routine, recreated in Rust; visually explains node splitting, merging and re-linking, giving implementation...
video · Added on July 24, 2025
· 32:35 · 22.4K views
Demonstrates applying constraint satisfaction algorithms to procedural content generation, detailing algorithmic concepts and code examples for maps, plants, an...
video · Added on July 24, 2025
· 8:31 · 17.2K views
Technical deep dive into DragonflyDB’s architecture, concurrency model, memory layout, and trade-offs that enable >6 M ops/sec in a Redis-compatible distributed...
video · Added on July 24, 2025
· 5:02:09 · 723.6K views
Live coding session where George Hotz designs and trains a simple neural-network chess engine, examining model architecture, training loop, and gameplay integra...
video · Added on July 24, 2025
· 1:12:33 · 229.7K views
Explores how undefined behavior in C enables aggressive compiler optimizations, illustrating subtle performance-related bugs and best practices for writing safe...
video · Added on July 24, 2025
· · views
Step-by-step retrospective on building a full COOL compiler in C—including lexer, parser, type checker, IR, and x86 assembly backend—highlighting practical impl...
video · Added on July 24, 2025
· 19:48 · 231.3K views
Step-by-step project that assembles fundamental digital components into a functioning minimalist CPU, explaining instruction decoding, control signals and integ...
video · Added on July 24, 2025
· 40:41 · 2.5K views
Conference talk distilling principles and practices for designing, operating and evolving large-scale, failure-tolerant software systems, with emphasis on compl...
video · Added on July 24, 2025
· 2:37:51 · 1.4K views
Live coding session that builds a WebAssembly software renderer from scratch, adding stb_image loading, alpha blending and rotation logic to texture-map a recta...
video · Added on July 24, 2025
· 18:08 · 8.9K views
Walk-through comparing array-oriented languages APL, BQN and Uiua by re-implementing and refactoring the standard-deviation algorithm, highlighting language sem...
video · Added on July 24, 2025
· · views
Vlog-style chronicle of building a custom OpenGL-based game engine over four months, sharing source code, rendering techniques, and learning resources, effectiv...
video · Added on July 24, 2025
· 34:59 · 16.7K views
Practical talk on parallel programming that outlines why naïve multithreading can degrade performance and details debugging techniques, cache effects, and CPU u...
video · Added on July 24, 2025
· · views
Deep dive into two interacting Zig language features—Result Location Semantics and Parameter Reference Optimization—exploring their compiler-level trade-offs, p...
video · Added on July 24, 2025
· 30:12 · 102.2K views
Casey Muratori analyzes smart-pointers, RAII, Rust’s borrow checker, and zero-initialization, advocating alternative memory-handling and error-handling patterns...
video · Added on July 24, 2025
· 39:58 · 43.0K views
Fireside-style discussion with a former Stripe applied-AI engineer about the technical evolution of GPT-3/4 and other generative language models, and the broade...
video · Added on July 24, 2025
· 1:23:07 · 22.1K views
Simon Peyton Jones discusses small-core language design in Haskell and the new Verse language, delving into functional-logic programming concepts and their educ...
video · Added on July 24, 2025
· 13:50 · 12.2K views
Jonathan Blow outlines the design philosophy, type system, and compilation goals of his Jai programming language, offering insights into modern language design ...
video · Added on July 24, 2025
· · views
Step-by-step tutorial in Zig that builds a Trie, handles manual memory allocation/freeing, and benchmarks lookup performance, demonstrating practical data-struc...
video · Added on July 24, 2025
· 2:01:08 · 22.4K views
Jonathan Blow’s talk “The Gauntlet” critiques contemporary software practices and outlines design principles and language-level features (drawn from his Jai wor...
video · Added on July 24, 2025
· 1:02:45 · 9.5K views
Walk-through of refactoring LearnOpenGL’s font-rendering sample—profiling GPU/CPU bottlenecks, redesigning glyph batching, and achieving a 10× frame-rate improv...
video · Added on July 24, 2025
· · views
A live coding session adding a file-manager panel to a custom text editor, showcasing practical C programming, UI integration, and iterative development workflo...
video · Added on July 24, 2025
· 16:16 · 75.3K views
A guided derivation and proof-oriented explanation of A* search and related path-finding algorithms used in mapping applications, emphasizing optimality and heu...
video · Added on July 24, 2025
· 2:50:14 · 322.9K views
A case study demonstrating how refactoring large, complex codebases into simpler designs can yield order-of-magnitude speedups, with detailed profiling and opti...
video · Added on July 24, 2025
· 39:02 · 15.6K views
A practitioner talk on adopting the Zig programming language in real-world production environments, covering build system details, tooling, deployment lessons, ...
video · Added on July 24, 2025
· 27:01 · 27.9K views
Technical lecture showing how to accelerate text parsing by leveraging SIMD instructions, delving into low-level CPU mechanics, data alignment, and practical co...
video · Added on July 24, 2025
· · views
Lightning talk on C++ undefined behavior, illustrating how compilers leverage UB for optimization, why certain constructs are risky, and what developers need to...
video · Added on July 24, 2025
· 48:27 · 16.5K views
Conference talk introducing data-oriented programming in Java 21, outlining its four core principles and showing how records, sealed classes, and pattern matchi...
video · Added on July 24, 2025
· 39:39 · 42.4K views
A detailed tutorial that walks through binary loading, stack and heap operation, and the exact in-memory layout of Rust primitives, structs, enums, and smart po...
video · Added on July 24, 2025
· 18:23 · 327.5K views
An educational deep-dive into the source coding theorem, entropy, arithmetic coding, and asymmetric numeral systems, illustrating how these information-theoreti...
video · Added on July 24, 2025
· · views
Introductory talk on Rust’s ownership, borrowing, and lifetime system, demonstrating how the language enforces safe memory access and prevents common errors suc...
video · Added on July 24, 2025
· 1:50:14 · 139.2K views
Chandler Carruth explains how modern C++ compilers perform optimization passes, inlining, and code generation, helping developers write code that the optimizer ...
video · Added on July 24, 2025
· · views
Walk-through of implementing a minimal malloc-style dynamic memory allocator in C, covering free lists, block headers, splitting/merging, and alignment concerns...
video · Added on July 24, 2025
· 59:44 · 185.2K views
CppCon talk illustrating how cache hierarchies, branch prediction, alignment, and SIMD influence C++ performance and providing guidelines for writing hardware-c...
video · Added on July 24, 2025
· 30:54 · 10.7K views
Demonstrates building and training a single-layer neural network entirely in x86-64 assembly language, covering forward pass, MSE loss, back-propagation, and lo...
video · Added on July 24, 2025
· 2:16:40 · 87.5K views
A wide-ranging interview with Casey Muratori focusing on lessons learned from decades of writing highly optimized code and cultivating a performance-aware progr...
video · Added on July 24, 2025
· · views
An in-depth critique of microservices as a software-architecture pattern, weighing their impact on developer productivity, maintainability, and long-term techni...
video · Added on July 24, 2025
· · views
University lecture surveys database scalability techniques—sharding, replication, consistency models, and fault tolerance—framing them within distributed-system...
video · Added on July 24, 2025
· 12:45 · 126.1K views
Introduces an early, concrete application of category theory via algebraic topology, illustrating how categorical constructs map to homotopy concepts within top...
video · Added on July 24, 2025
· 20:18 · 635.9K views
Explains the mechanics and trade-offs of modern generative models, contrasting autoregressive transformer pipelines with denoising diffusion processes and detai...
video · Added on July 24, 2025
· · views
Harvard-trained psychiatrist Dr. K gives practical strategies for sustaining focus and combating algorithm-driven distractions, emphasizing behavioral routines ...
video · Added on July 24, 2025
· 10:43 · 109.5K views
Practical C++ demonstration of how cache locality and branch prediction affect real-world runtime, showcasing code patterns and optimizations to exploit modern ...
video · Added on July 24, 2025
· · views
Detailed instructional walkthrough of the OSI and TCP/IP networking models, explaining each layer’s protocols and responsibilities for practitioners learning co...
video · Added on July 24, 2025
· · views
Extended conversation with Casey Muratori analysing AMD’s upcoming Zen 5 CPU micro-architecture and how hardware design choices (caches, branch predictors, inst...
video · Added on July 24, 2025
· 46:22 · 118.5K views
François Chollet’s AGI-24 keynote critiques current LLM capabilities, uses ARC benchmark results to expose compositional reasoning gaps, and proposes integratin...
video · Added on July 24, 2025
· 30:26 · 3.4K views
Daily livestream coding session that incrementally implements core kernel functionality—memory management, drivers, and boot code—while explaining practical OS ...
video · Added on July 24, 2025
· 52:01 · 55.6K views
Stephen Wolfram’s keynote explores the broad “computational paradigm” as a unifying lens across physics, technology, AI, biology and mathematics—an ideas-driven...
video · Added on July 24, 2025
· 11:01 · 69.4K views
Survey-style overview of several famous open mathematical conjectures related to calculus and analysis, outlining what is known and why they remain unsolved.
video · Added on July 24, 2025
· 1:23:09 · 2.1K views
A live coding / debugging session with the TigerBeetle team that digs into tracing, measurement and architectural choices to diagnose the latency bottlenecks in...
video · Added on July 24, 2025
· 24:40 · 2.9K views
RustConf 2024 talk demonstrating how developers can leverage Rust’s compiler diagnostics, type system, and tooling to iteratively write correct, idiomatic code—...
video · Added on July 24, 2025
· 3:23:23 · 9.6K views
Long-form industry analysis show covering semiconductor manufacturing roadmaps, AMD’s 2 nm “Venice” chiplets, yield calculations, HBM4, CHIPS Act developments, ...
video · Added on July 24, 2025
· 2:06:52 · 55.4K views
Terry A. Davis live-codes and explains portions of the TempleOS compiler and associated OS internals, providing firsthand systems-programming insight into a hom...
video · Added on July 24, 2025
· 18:18 · 67.6K views
Walk-through of why GPU programming lacks the portability and toolchain simplicity enjoyed by CPUs, covering driver fragmentation, vendor-specific shading langu...
video · Added on July 24, 2025
· 26:14 · 30.5K views
An example-driven primer on categorical limits, building from sets and vector spaces to equalisers, fibre products, cones, and universal properties, aimed at ne...
video · Added on July 24, 2025
· 25:26 · 626 views
The first lecture in a sheaf-theory series, defining presheaf stalks, sheafification, and exactness concepts such as kernels and images within a categorical fra...
video · Added on July 24, 2025
· 44:23 · 935.2K views
Provides a rapid, physics-motivated introduction to geometric algebra, covering multivectors, grades, geometric products, and rotors as an extension of linear-a...
video · Added on July 24, 2025
· 1:16:05 · 1.4K views
Live demonstration of the Lean interactive theorem prover, showing how formal logic rules are encoded, manipulated, and verified, and discussing its role in mat...
video · Added on July 24, 2025
· 17:57 · 201.6K views
Explains the P vs NP problem by reducing arbitrary algorithms to SAT circuits, illustrating NP-completeness, reversibility, and implications for cryptography.
bookmark · Added on July 24, 2025
· 32 min read
bookmark · Added on July 24, 2025
· 52 min read
bookmark · Added on July 24, 2025
· 1 min read
This document describes the custom floating-point formats supported by the Tenstorrent Wormhole B0 architecture. These formats deviate from IEEE 754 standards in specific ways to optimize for AI workl
bookmark · Added on July 23, 2025
· 1 min read
Distillation means training a model to imitate another model's outputs. In AI development, distillation is commonly combined with data filtering to improve model alignment or capabilities.
video · Added on July 22, 2025
· 12:39 · 751.0K views
Clear, in-depth explanation of B-trees, a fundamental data structure used in databases and file systems.
video · Added on July 22, 2025
· · views
Podcast episode delving into C, low-level programming, debugging, tooling, and security topics—valuable insights for systems programmers.
video · Added on July 22, 2025
· 2:04:25 · 32.1K views
In-depth discussion with two language designers on compiler construction and programming-language paradigms—relevant, expert-level content.
video · Added on July 22, 2025
· 23:54 · 147.7K views
Clear technical explanation of the groundbreaking MIP*=RE complexity-theory result—valuable foundational content for theoretical computer scientists.
video · Added on July 22, 2025
· 1:32:51 · 3.7K views
Lecture on CUDA fundamental optimizations provides specialized technical guidance for high-performance GPU computing.
video · Added on July 22, 2025
· 47:46 · 53.6K views
Conference keynote by Zig creator Andrew Kelley on lowering barriers to systems programming is directly relevant to professional development in low-level softwa...
video · Added on July 22, 2025
· 14:22 · 131.3K views
Discussion clip where Jonathan Blow outlines principles for operating-system design offers conceptual insights relevant to systems programmers.
video · Added on July 22, 2025
· 39:38 · 10.6K views
Step-by-step series on writing a compiler and interpreter in Rust is a valuable, in-depth educational resource on compiler construction.
video · Added on July 22, 2025
· 1:59:25 · 54.6K views
Live-coding session showing how to port code from Zig to the experimental Jai language provides practical insight into systems-level programming and language de...
video · Added on July 22, 2025
· 21:06 · 18.8K views
Technical presentation of the rope data structure with code overview, useful for efficient string manipulation knowledge.
video · Added on July 22, 2025
· · views
Concise educational video demystifying lexers, a core component of compiler design.
video · Added on July 22, 2025
· 16:57 · 539.0K views
Clear explanation of how memory storage works at the transistor level, valuable for understanding computer architecture fundamentals.
video · Added on July 22, 2025
· · views
Live-coded walkthrough of building an interactive Bézier-curve editor with HTML Canvas, offering practical graphics and web-development techniques.
video · Added on July 22, 2025
· · views
Introductory episode of a series on CUDA/GPU programming, providing resources and setting up the technical context for massively-parallel computing.
video · Added on July 22, 2025
· 25:13 · 8.2K views
Sasha Rush delivers practical estimation techniques for Transformer/LLM models, beneficial for ML researchers and practitioners.
video · Added on July 22, 2025
· 57:20 · 14.0K views
Technical presentation explaining Zig’s build system (build.zig) to C++ engineers; practical content for build and tooling specialists.
video · Added on July 22, 2025
· · views
PLDI 2024 research talk introducing a new safe GPU programming language; highly relevant to programming-language researchers and GPU systems developers.
video · Added on July 22, 2025
· 8:24 · 22.8K views
Provides a detailed walkthrough of the Zig 0.13 release notes, covering compiler, std-lib and build-system changes—useful for programmers using or evaluating Zi...
video · Added on July 22, 2025
· · views
Introductory tutorial for the Zig programming language, providing practical setup instructions and code examples, directly relevant to software engineering educ...
video · Added on July 22, 2025
· 1:01:21 · 7.2K views
In-depth discussion on designing a programming language around its AST, covering type systems, effects, and implementation details—valuable for language and com...
video · Added on July 22, 2025
· 22:30 · 287.1K views
Explains the algorithmic technique of dynamic programming and is relevant for software engineers preparing for technical interviews or improving algorithmic ski...
video · Added on July 22, 2025
· 1:26:27 · 12.7K views
Conference talk previewing upcoming C++26 language features—directly relevant technical content for software engineers.
video · Added on July 22, 2025
· 1:47:49 · 75.8K views
In-depth talk on arena allocators and lifetime management, offering practical memory-management strategies for systems programmers.
video · Added on July 22, 2025
· 1:00:46 · 251.1K views
CppCon lecture presenting data-oriented design versus OOP with concrete performance case studies—highly relevant to C++ practitioners.
video · Added on July 22, 2025
· 37:40 · 21.6K views
Conference talk by a well-known expert examining lesser-known Java behaviors and pitfalls—valuable for practicing developers.
video · Added on July 22, 2025
· · views
Hands-on demonstration of error payload patterns in the Zig programming language; solid low-level programming tutorial content.
video · Added on July 22, 2025
· 57:46 · 241.8K views
CppCon lecture systematically covering all 105 STL algorithms, a high-value educational resource for C++ developers.
video · Added on July 22, 2025
· 10:03 · 368.0K views
Explains a remote-code-execution exploit and provides source code, offering valuable security and reverse-engineering insights.
video · Added on July 22, 2025
· 2:40:29 · 5.2K views
Long-form technical interview with the creator of the C3 programming language covering language design and the speaker’s engineering career.
video · Added on July 22, 2025
· 34:06 · 5.6K views
Conference talk detailing a novel fast linking approach for the Roc language, directly relevant to compilers/linkers and build performance.
video · Added on July 22, 2025
· 1:00:07 · 307.6K views
CppCon conference talk delivering in-depth techniques for ultra-low-latency C++ systems, directly relevant to performance-critical software engineering.
video · Added on July 22, 2025
· · views
Detailed case study of Lichess’s architecture and engineering practices (Scala, MongoDB, Snabbdom, bare-metal hosting) offering insights into high-impact solo d...
video · Added on July 22, 2025
· 22:42 · 1.5M views
High-quality educational lecture on how transformers store factual information, directly relevant to AI interpretability.
video · Added on July 22, 2025
· 59:23 · 89.7K views
Provides an in-depth mathematical explanation of tensors, suitable for learners of linear algebra and theoretical physics.
video · Added on July 22, 2025
· 24:47 · 145 views
Technical summary of a current arXiv paper on large-scale model merging, providing up-to-date insights for ML practitioners.
video · Added on July 22, 2025
· · views
Hands-on tutorial showing how to implement a software renderer in C, relevant to graphics programming and low-level optimization.
video · Added on July 22, 2025
· 1:15:14 · 95.6K views
Bartosz Milewski provides an intensive introduction to category theory with programming examples, fitting both educational and theoretical criteria.
video · Added on July 22, 2025
· 1:03:14 · 71.0K views
Conference-style talk by Erik Meijer connecting category theory to interface-based design and Java 8 lambdas; valuable for programmers interested in theoretical...
video · Added on July 22, 2025
· 39:27 · 35.0K views
Interview with the Zig language creator discussing compiler-integrated linting and tooling—practical insights for language and tooling enthusiasts.
video · Added on July 22, 2025
· 41:40 · 68.7K views
Philip Wadler’s lecture introduces category theory concepts for programmers, bridging mathematics and software development.
video · Added on July 22, 2025
· 59:42 · 36.7K views
CppCon talk focused on everyday performance techniques in modern C++, directly useful for software engineers concerned with optimization.
video · Added on July 22, 2025
· 19:24 · 21.1K views
Technical presentation linking Rust’s type system with type theory and proof techniques, valuable for language theorists and systems programmers.
video · Added on July 22, 2025
· 41:35 · 8.0K views
Academic talk from the Simons Institute presenting a unified framework for efficient linear layers in Transformers—highly relevant to deep-learning researchers ...
video · Added on July 22, 2025
· 47:55 · 15.1K views
Technical presentation on building a type system from scratch, relevant to language and compiler enthusiasts.
video · Added on July 22, 2025
· 50:52 · 96.2K views
Conference talk by Philip Wadler connecting category theory to programming; foundational material for programmers interested in type theory.
video · Added on July 22, 2025
· 1:52:46 · 127.3K views
Extended, in-depth interview with AI researcher Joscha Bach covering advanced AI architectures, cognition, and regulatory issues—valuable for AI and cognitive s...
video · Added on July 22, 2025
· 1:08:59 · 19.6K views
Stanford CS 229S lecture on large-scale inference math and AI megaclusters—direct, advanced technical content useful to ML researchers and engineers.
video · Added on July 22, 2025
· · views
Educational explanation of imaginary (complex) numbers aimed at engineering students, providing foundational mathematical knowledge.
video · Added on July 22, 2025
· 54:23 · 107.2K views
Panel discussion with leading scientists on how AI accelerates scientific discovery; offers strategic and technical perspectives on AI applications in research.
video · Added on July 22, 2025
· 25:16 · 14.9K views
Keynote outlining the technical and organizational roadmap of the Rust language—important insight for systems programmers and language researchers.
video · Added on July 22, 2025
· 1:45:37 · 28.0K views
In-depth lecture by AI researcher Joscha Bach on philosophical and cognitive aspects of AI, valuable for understanding conceptual foundations and ethics.
video · Added on July 22, 2025
· 1:10:34 · 1.0K views
Academic tutorial on computational models of visual attention with hands-on MATLAB code; directly relevant for researchers in computational neuroscience and AI.
video · Added on July 22, 2025
· 1:36:54 · 71.8K views
Filmed at the March 11, 2014 LispNYC meetup at Meetup HQ in NYC.
ABOUT DATA COUNCIL:
Data Council (https://www.datacouncil.ai/) is a community and conference ...
video · Added on July 22, 2025
· 1:06:12 · 15.4K views
This is a gentle introduction to applied category theory – more about the history of the subject, what people are trying to do, and my own personal involvement ...
video · Added on July 22, 2025
· 1:01:07 · 38.7K views
Description:
Category theory and its applications
Slides:
No Slides
Uploader: LambdaConf
Duration: 3667s
Views: 38695
video · Added on July 22, 2025
· 20:28 · 7.7K views
_Note: This video contains hardcoded subtitles. For the most accurate English video transcription, please enable YouTube subtitles and select "English" under Yo...
video · Added on July 22, 2025
· · views
Lesson 1 is concerned with defining the category of Abstract Sets and Arbitrary Mappings. We also define our first Limit and Co-Limit: The Terminal Object, and ...
video · Added on July 22, 2025
· 46:24 · 30.1K views
Video lectures at MIT. See http://brendanfong.com/programmingcats.html
Lecturers: Brendan Fong, Bartosz Milewski, David Spivak
Summary: In this course we e...
video · Added on July 22, 2025
· 1:28:00 · 6.3K views
https://cppnorth.ca/
---
C++ Memory Model: from C++11 to C++23
In the realm of C++ development, threading and memory management play a crucial role in crafti...
video · Added on July 22, 2025
· 1:08:49 · 14.5K views
Strong static typing detects a lot of bugs at compile time, so why would anyone prefer to program in JavaScript or Python? The main reason is that type systems ...
video · Added on July 22, 2025
· · views
Matrix multiplication on a GPU using CUDA C/C++.
Code Repository: https://github.com/tgautam03/xGeMM
Video Notes and Code Explainers: https://0mean1sigma.com...
video · Added on July 22, 2025
· 46:44 · 16.3K views
Dans cet entretien, le mathématicien Etienne Ghys interroge ses pairs Terence Tao, Nalini Anantharaman et Timothy Gowers autour de la question de la preuve dans...
video · Added on July 22, 2025
· 27:41 · 3.4K views
A talk by Joshua Liebow-Feeser (Software Engineer, Fuchsia Security, Google) at RustConf 2024 in Montreal, Canada & online on September 12. Hosted by the Rust F...
video · Added on July 22, 2025
· 27:37 · 5.2K views
To view a version of this talk without hardcoded subtitles, click here: https://www.youtube.com/watch?v=qd3x5MCUrhw
_Note: This video contains hardcoded subtit...
video · Added on July 22, 2025
· · views
✘ Links:
How I learned vim: https://amzn.to/3ONVG5R
How I learned C programming: https://amzn.to/41pvkhU
My keyboard: https://amzn.to/3Vudsih
Patreon: https://w...
video · Added on July 22, 2025
· · views
As I mentioned in the video, here are the links to the three math books that changed my life for the better:
1) Peter Selby and Steven L. Slavin's "Practical A...
video · Added on July 22, 2025
· 22:06 · 54.0K views
When you hear that someone is "studying algebra". What comes to mind?
Are they drilling through thousands of factorisation problems?
Are they an undergraduate s...
video · Added on July 22, 2025
· 25:49 · 43.2K views
To fully utilise the exciting category theory we've learnt so far, we need a way to abstract definitions from a specific category and then be able to apply them...
video · Added on July 22, 2025
· 29:05 · 191.9K views
In this video, I describe how all of the different theorems of multivariable calculus (the Fundamental Theorem of Line Integrals, Green's Theorem, Stokes' Theor...
video · Added on July 22, 2025
· · views
Appears to be a university operating-systems course recap; likely summarizes key OS concepts for the term, which is relevant academic material despite minimal d...
video · Added on July 22, 2025
· · views
Links to everything discussed in the video:
https://www.youtube.com/watch?v=tD5NrevFtbU
https://www.computerenhance.com/
https://www.computerenhance.com/p/clea...
video · Added on July 22, 2025
· 2:20:44 · 100.1K views
Although recorded as a live Twitch stream, it is a hands-on coding session on low-level TCP/network programming with references to code and technical articles, ...
video · Added on July 22, 2025
· 31:38 · 5.6K views
This talk will explore the world of memory allocation and its impact on application performance. Memory allocators are an often overlooked topic but are the bac...
video · Added on July 22, 2025
· 19:24 · 36.4K views
By first outlining a mathematically rigorous definition of a category, we can embark on a fascinating journey through category theory with examples from mathema...
video · Added on July 22, 2025
· 41:28 · 3.4K views
What did category theory ever do for us (functional programmers)? - An extreme pragmatic and un-academic approach. Examples are in Scala.
Talk given at Scale b...
video · Added on July 22, 2025
· · views
✘ Get Source Code and Early Video Access on Patreon:
https://www.patreon.com/c/HirschDaniel
✘ Learn to Code:
https://app.codecrafters.io/join?via=danieldeer
C...
video · Added on July 22, 2025
· 28:49 · 4.0K views
Category theory is a framework that unifies all of mathematics in an abstract and homogeneous language by extracting the essence of mathematical structures. The...
video · Added on July 22, 2025
· · views
PLAY SKLIME HERE: https://store.steampowered.com/app/1380970/Sklime__A_Difficult_Climbing_Adventure/
Acerola's color palette generator: https://evannorton.gith...
video · Added on July 22, 2025
· 1:15:45 · 170.9K views
http://CppCon.org
—
Presentation Slides, PDFs, Source Code and other presenter materials are available at: https://github.com/CppCon/CppCon2017
—
In 2012, Matt ...
video · Added on July 22, 2025
· 1:06:39 · 132.3K views
A summary of why Jane Street uses OCaml, including a discussion of how OCaml fits into the broader space of programming languages. Given to our summer interns....
video · Added on July 22, 2025
· 32:55 · 283.1K views
"The most dangerous thought you can have as a creative person is to think you know what you're doing."
Presented at Dropbox's DBX conference on July 9, 2013.
Al...
video · Added on July 22, 2025
· 12:19 · 6.4K views
PL Virtual Meetup: https://www.meetup.com/Programming-Languages-Toronto-Meetup/
CtFP Textbook: https://github.com/hmemcpy/milewski-ctfp-pdf
Github Repo: https:/...
video · Added on July 22, 2025
· 1:00:04 · 6.4K views
Conversation with an engineer about designing a functional UI framework in OCaml, covering architecture decisions and related tooling—fits technical-educational...
video · Added on July 22, 2025
· 1:27:34 · 8.3K views
http://cppnow.org
—
Presentation Slides, PDFs, Source Code and other presenter materials are available at: http://cppnow.org/history/2019/talks/
—
Composition i...
video · Added on July 22, 2025
· 13:19 · 61.6K views
Joscha Bach puts forward his radical theory of cyber animism.
Can the natural world be understood in terms of software agents?
Watch the full talk at https://...
video · Added on July 22, 2025
· 22:24 · 68.9K views
Modern CPUs manage to speed up even the simplest code, Matt Godbolt explains how there's a lot of juggling going on even in the simple use of registers.
Compu...
video · Added on July 22, 2025
· 1:13:45 · 9.5K views
In this video I give an overview of things I've learned after using Odin professionally on the same project for a year.
Uploader: Rickard Andersson (gonz)
Durat...
video · Added on July 22, 2025
· 14:28 · 79.7K views
In this video I will share something that will change the way you think about mathematics forever. If you learn this one thing, you can then go on and learn mor...
video · Added on July 22, 2025
· 3:48:10 · 27.3K views
Extended Q&A by a respected Rust educator covering numerous technical questions, development practices, and career advice—useful educational resource.
video · Added on July 22, 2025
· · views
P.S. I am looking for a job! If you can help me I would be very grateful. Message me on LinkedIn: www.linkedin.com/in/marc-maliar/
Uploader: Marc Maliar
Duratio...
video · Added on July 22, 2025
· 35:19 · 392.3K views
Clay (short for C Layout) is a high performance 2D UI layout library.
See the website at
https://nicbarker.com/clay
for more info, or check out the github repo...
video · Added on July 22, 2025
· · views
The provided audio is an explanation of sets in mathematics. It details ways to define sets, including roster notation, semantic descriptions, and set-builder n...
video · Added on July 22, 2025
· 16:09 · 1.2M views
Patreon: https://www.patreon.com/ahoy
Merch: https://ahoy-shop.fourthwall.com/
00:00 Introduction
01:04 Chris Sawyer's Early Career
03:56 Transition to the PC
...
video · Added on July 22, 2025
· · views
Euler's proof combines calculus and combinatorics in a remarkable way to prove that the primes never end! Euler's proof also reveals a deep connection between t...
video · Added on July 22, 2025
· 45:27 · 27.6K views
In this Lambda World 2019 keynote, Emily Riehl discusses category theory and computational effects.
Slides are available here: http://www.math.jhu.edu/~eriehl...
video · Added on July 22, 2025
· 16:50 · 62.2K views
In this video, we explore basic concepts of Measure Theory and the Lebesgue Integral. We will learn about important theorems of Lebesgue Integration like the Mo...
video · Added on July 22, 2025
· 18:32 · 1.4M views
There's a lot more to physics than F = ma! In this physics mini lesson, I'll introduce you to the Lagrangian and Hamiltonian formulations of mechanics. Get the ...
video · Added on July 22, 2025
· · views
This video is part of the “Real Analysis” series I am making.
Thanks and enjoy the video!
Real Analysis Playlist: https://www.youtube.com/playlist?list=PLDidd...
video · Added on July 22, 2025
· 1:26:47 · 85.8K views
François Chollet discusses the outcomes of the ARC-AGI (Abstraction and Reasoning Corpus) Prize competition in 2024, where accuracy rose from 33% to 55.5% on a ...
video · Added on July 22, 2025
· 28:05 · 28.1K views
At an Anthropic Research Salon event in San Francisco, four of our researchers—Alex Tamkin, Jan Leike, Amanda Askell and Josh Batson—discussed alignment science...
video · Added on July 22, 2025
· 13:53 · 120.5K views
A peek into the world of Riemann surfaces, and how complex analysis is algebra in disguise. Secure your privacy with Surfshark! Enter coupon code ALEPH for an e...
video · Added on July 22, 2025
· 11:50 · 251.2K views
Algebraic geometry is often presented as the study of zeroes of polynomial equations. But it's really about something much deeper: the duality between abstract ...
video · Added on July 22, 2025
· 2:46:59 · 24.3K views
This time I'm joined by Ryan Fleury, who is working on RAD Debugger in Epic Game Tools (formerly RAD Game Tools). It's incredibly detailed, technical deep dive ...
video · Added on July 22, 2025
· 51:49 · 101.0K views
The co-founders of Anthropic discuss the past, present, and future of Anthropic. From left to right: Chris Olah, Jack Clark, Daniela Amodei, Sam McCandlish, Tom...
video · Added on July 22, 2025
· · views
If you're like me, you've had a Feynman phase. In this video I talk about outgrowing this phase, and why I still recommend everyone watch Feynman.
00:02 The Fe...
video · Added on July 22, 2025
· 1:27:21 · 10.6K views
Definition of category. Example categories. Isomorphisms and monomorphisms.
Uploader: Alex Simpson
Duration: 5241s
Views: 10578
video · Added on July 22, 2025
· 42:32 · 1.8M views
A whistle-stop tour of how computers work, from how silicon is used to make computer chips, perform arithmetic to how programs run and computer graphics are dis...
video · Added on July 22, 2025
· · views
Rust lets you do efficient reference-counted strings and dynamic arrays using Arc basically just as easily as their owning (and deep-cloning) equivalents, Strin...
video · Added on July 22, 2025
· 1:04:53 · 75.5K views
EECS Colloquium
Wednesday, September 18, 2019
306 Soda Hall (HP Auditorium)
4-5p
Captions available upon request
Uploader: UC Berkeley EECS
Duration: 3893s
Vi...
video · Added on July 22, 2025
· 55:58 · 38.9K views
Rust is increasingly used in high-stakes sectors where errors can have serious consequences. In fields such as healthcare, aerospace, defense, and finance, soft...
video · Added on July 22, 2025
· 15:55 · 883.3K views
STEMerch Store: https://stemerch.com/
Support the Channel: https://www.patreon.com/zachstar
PayPal(one time donation): https://www.paypal.me/ZachStarYT
Versión...
video · Added on July 22, 2025
· 3:04:30 · 101.4K views
Long-form discussion with Jonathan Blow focused on the design and status of the Jai programming language—valuable insight into language design and game-engine t...
video · Added on July 22, 2025
· 1:00:53 · 6.4K views
Playlist: https://www.youtube.com/playlist?list=PLOROtRhtegr7DmeMyFxfKxsljAVsAn_X4
What is category theory? In this lecture we introduce categories, which incl...
video · Added on July 22, 2025
· 34:40 · 212.7K views
A quick run through of some tips for programming in C that have helped with my productivity and enjoyment of the language.
Referenced in this video:
Anthony C...
video · Added on July 22, 2025
· 58:16 · 223.3K views
Kurt Gödel showed that mathematical thinking cannot be captured in a formal axiomatic reasoning system. What does this deep result mean in practice? What are th...
video · Added on July 22, 2025
· · views
Carl Gauss was a child prodigy who reinvented mathematics. Try https://brilliant.org/Newsthink/ for FREE for 30 days, and get 20% off your annual premium subscr...
video · Added on July 22, 2025
· 18:36 · 3.8M views
Sign up with brilliant and get 20% off your annual subscription: https://brilliant.org/ZachStar/
STEMerch Store: https://stemerch.com/
Support the Channel: htt...
video · Added on July 22, 2025
· 1:03:21 · 175.2K views
Over the last 10 years we've seen Machine Learning consume everything, from the tech industry to the Nobel Prize, and yes, even the ML acronym. This rise in ML ...
video · Added on July 22, 2025
· 29:45 · 20.6K views
This lecture is part of an online course on algebraic topology.
We define the fundamental group, calculate it for some easy examples (vector spaces and spheres...
video · Added on July 22, 2025
· 18:20 · 26.5K views
Goal.
Explaining basic concepts of algebraic topology in an intuitive way.
This time.
What is...homotopy? Or: The same shape!?
Disclaimer.
Nobody is perfec...
video · Added on July 22, 2025
· 27:16 · 28.4K views
In this Tech Talk, Tudor Brindus, a software engineer at Jane Street, shares his expertise on reducing jitter—deviations from mean input processing times—in low...
video · Added on July 22, 2025
· 49:54 · 8.3K views
2024 LLVM Developers' Meeting
https://llvm.org/devmtg/2024-10/
------
Rust ❤️ LLVM
Speaker: Nikita Popov
------
Slides: https://llvm.org/devmtg/2024-10/slides/k...
video · Added on July 22, 2025
· 1:03:41 · 146.2K views
Lecture 1 of Algebraic Topology course by Pierre Albin.
Uploader: Mat Neth
Duration: 3821s
Views: 146152
video · Added on July 22, 2025
· 25:04 · 46.9K views
My code: https://github.com/keyframe41/ParticleSimulation
Part 1 video: https://youtu.be/XL8B5nzNEOc
Finally this video is done. Still have more particle ideas ...
video · Added on July 22, 2025
· 26:52 · 455.5K views
—————SOURCES————————————————————————
Percolation – Béla Bollobás and Oliver Riordan
Cambridge University Press, New York, 2006.
Sixty Years of Percolation – H...
video · Added on July 22, 2025
· · views
Uploader: Struggling Grad Student
Duration: 1530s
Views: 606456
video · Added on July 22, 2025
· 37:47 · 3.9K views
The first in a two-lecture series on our recently-announced Categorical Deep Learning framework (categoricaldeeplearning.com), given as a lecture for the Geomet...
video · Added on July 22, 2025
· 14:31 · 50.6K views
#logic #prooftheory #modeltheory #goedel
Access exclusive content on Patreon: https://www.patreon.com/user?u=86649007
All the way at the foundations of mathe...
video · Added on July 22, 2025
· 36:46 · 6.4K views
This presentation was recorded at YOW! 2019. #GOTOcon #YOW
https://yowcon.com
Hillel Wayne - Author of Practical TLA+ @hillelwayne3236
RESOURCES
https://twit...
video · Added on July 22, 2025
· · views
To try everything Brilliant has to offer—free—for a full 30 days, visit https://brilliant.org/TheCherno . You’ll also get 20% off an annual premium subscription...
video · Added on July 22, 2025
· 19:55 · 4.3M views
An explanation of fractal dimension.
Help fund future projects: https://www.patreon.com/3blue1brown
An equally valuable form of support is to simply share some ...
video · Added on July 22, 2025
· · views
I made Soft body physics and a soft body tetris from scratch. This was supposed to take like 2 weeks and ended up taking like 6 weeks so pls enjoy.
Walaber Ent...
video · Added on July 22, 2025
· 7:53 · 28.2K views
We explore a counter-intuitive property of proof by induction, where one inequality can be proven by induction easily, but a seemingly easier to prove, weaker, ...
video · Added on July 22, 2025
· 59:44 · 256.8K views
Trading at light speed: designing low latency systems in C++ - David Gross - Meeting C++ 2022
Slides: https://slides.meetingcpp.com
Survey: https://survey.meeti...
video · Added on July 22, 2025
· 27:10 · 78.4K views
AI. DeepSeek. OpenAI. Tech competition.
Support me!
Donation and Support:
https://buymeacoffee.com/windspiritz
https://www.patreon.com/Awakening_Richard
Upl...
video · Added on July 22, 2025
· · views
In this video, we'll explore the complex but essential concept of memory allocation by taking it to the next dimension - 3D visualization. You'll witness how me...
video · Added on July 22, 2025
· 28:33 · 2.8M views
The Cosmic Distance Ladder, how we learned distances in the heavens.
P
Patreon supporters see early views of new videos: https://www.patreon.com/3blue1brown
Ar...
video · Added on July 22, 2025
· 34:53 · 105.6K views
Relatively speedy-to-access cache saves your computer having to trudge over to the RAM, but with multiple levels of cache memory, how does it all work?
Matt G...
video · Added on July 22, 2025
· · views
#latenightcoding #softwaredevelopment #coding
Check out byeAI here: https://byeai.dev
This is how I take notes entirely with Neovim (and a bunch of other tool...
video · Added on July 22, 2025
· 18:10 · 134.8K views
Focuses on techniques for high-performance C++ code, aligning with software optimization and best practices.
video · Added on July 22, 2025
· 1:48:19 · 16.0K views
Ramon, the creator of raylib library, joins me to discuss its C code and design! We talk about open source model of development as well! Join us!
https://www.r...
video · Added on July 22, 2025
· 40:33 · 87.9K views
Charlie Marsh is the founder of Astral, which develops uv, a next-generation Python package manager written in Rust. In this talk, Charlie details the unique ch...
video · Added on July 22, 2025
· 53:45 · 126.2K views
In this lecture from Sam Cohen’s 3rd year ‘Information Theory’ course, one of eight we are showing, Sam asks: how do we measure the amount of information we lea...
video · Added on July 22, 2025
· 51:26 · 14.8K views
Clement Bonnet discusses his novel approach to the ARC (Abstraction and Reasoning Corpus) challenge. Unlike approaches that rely on fine-tuning LLMs or generati...
video · Added on July 22, 2025
· 1:09:05 · 17.1K views
Daniel Franzen and Jan Disselhoff, the "ARChitects" are the official winners (with co-researcher David Hartmann) of the ARC Prize 2024. Filmed at Tufa Labs in Z...
video · Added on July 22, 2025
· 1:07:02 · 25.3K views
Sepp Hochreiter, the inventor of LSTM (Long Short-Term Memory) networks – a foundational technology in AI. Sepp discusses his journey, the origins of LSTM, and ...
video · Added on July 22, 2025
· 22:07 · 628.0K views
There's often a lot of emphasis in math on generalizing concepts beyond the domains where they were originally defined, but what are the limits of this process?...
video · Added on July 22, 2025
· · views
Claude Shannon is the father of Information Theory. Try https://brilliant.org/Newsthink/ for FREE for 30 days, and get 20% off your annual premium subscription....
video · Added on July 22, 2025
· 2:53:25 · 23.7K views
Jim Keller is a microprocessor engineer who has run the gauntlet of today’s leading tech companies during their peak performance years. He’s designed for Intel,...
video · Added on July 22, 2025
· 21:14 · 48.8K views
"If you think something is unsolvable it will not get solved. Solving problems is partly about believing you can solve everything and sometimes that means you ...
video · Added on July 22, 2025
· 2:13:10 · 6.6K views
Links:
https://david-vanderson.github.io/
Uploader: Zig SHOWTIME
Duration: 7990s
Views: 6579
video · Added on July 22, 2025
· 20:50 · 18.0K views
Continuous functions play a crucial role in various disciplines in math. We discuss the epsilon-delta criterion and formalize it in the programming language and...
video · Added on July 22, 2025
· 48:25 · 800.5K views
Doug McLean, retired Boeing Technical Fellow, discusses several examples of erroneous ways of looking at phenomena in aerodynamics, that have either taken hold ...
video · Added on July 22, 2025
· · views
Support this channel at:
https://buymeacoffee.com/simonoz
Code for animations and examples:
https://github.com/SzymonOzog/GPU_Programming
https://github.com/Sz...
video · Added on July 22, 2025
· 1:38:49 · 42.5K views
Research Scientist Hado van Hasselt covers policy algorithms that can learn policies directly and actor critic algorithms that combine value predictions for mor...
video · Added on July 22, 2025
· · views
I recreated Shazam's algorithm out of curiosity but mostly out of desperation. In this video, I explain how Shazam works and how I implemented the algorithm in ...
video · Added on July 22, 2025
· 2:21:04 · 12.4K views
MAKE HISTORY WITH US THIS SUMMER:
https://demystifysci.com/demysticon-2025
PATREON
https://www.patreon.com/c/demystifysci
PARADIGM DRIFT
https://demystifysci...
video · Added on July 22, 2025
· 18:09 · 680.7K views
In-depth analysis of a transformer variant (DeepSeek MLA) covering architecture, performance, and equations—highly relevant deep-learning material.
video · Added on July 22, 2025
· 13:00 · 12.1K views
Everyone wants AI hardware to test and develop on, and Tenstorrent is leading the way from the AI startups when it comes to development kits. This new round of ...
video · Added on July 22, 2025
· 42:48 · 21.8K views
Uploader: DACtv
Duration: 2568s
Views: 21759
video · Added on July 22, 2025
· · views
Clip from my interview with Jim Keller: https://www.youtube.com/watch?v=YOiXomG9FhE
Subscribe to the TechTechPotato main channel at: http://www.youtube.com/tec...
video · Added on July 22, 2025
· 39:42 · 2.9K views
Audio Noise Removal Version / 音声ノイズ除去バージョン
0925 The Future of RISC-V and RISC-V AI Jim Keller | CEO, Tenstorrent (Canada)
0925 RISC-VとAIとの未来 Jim Keller (ジム・...
video · Added on July 22, 2025
· 10:52 · 154.2K views
THE 70s MUST HAVE BEEN A WILD TIME TO BE ALIVE, right?
I often daydream about the lives of people who picked up C just at this perfect time, right at the start....
video · Added on July 22, 2025
· 10:31 · 64.6K views
Is category theory a mathematical theory? Or something more? In this brief presentation, educational designer Paul Dancstep shares an animated description of wh...
video · Added on July 22, 2025
· 9:18 · 13.2K views
Detailed code walk-through of a game engine in x64 assembly provides valuable low-level programming insight.
video · Added on July 22, 2025
· 19:51 · 407.6K views
Clear, well-structured explanation of zero-knowledge proofs with examples—solid educational cryptography content.
video · Added on July 22, 2025
· 4:12:10 · 32.1K views
Watch the first half of Day 1 of our RISC-V Technology Conference in Bangalore.
Opening Remarks from Minister Rajeev Chandrasekhar
Keynote from Tenstorrent Inc...
video · Added on July 22, 2025
· 1:01:16 · 99.9K views
Join The ACCU Membership For Exclusive Benefits, Discounts & Reduced Conference Ticket Pricing:
https://accu.org/menu-overviews/membership/
---
The Genius of R...
video · Added on July 22, 2025
· · views
Clip from my interview with Jim Keller and Ljubisa Bajic of Tenstorrent
https://www.youtube.com/watch?v=sMvudTBBQNw
Subscribe to the TechTechPotato main channe...
video · Added on July 22, 2025
· 23:00 · 2.9K views
Presented by Ljubisa Bajic, CEO and Lead Architect, and Stan Skokrac, VP Software, Tenstorrent
The machine learning field is red hot, and numerous teams have b...
video · Added on July 22, 2025
· · views
Clip discusses cognitive and organizational issues around code size, offering insight into software engineering practices and maintainability.
video · Added on July 22, 2025
· 56:23 · 117.5K views
Yann LeCun, Meta, gives the AMS Josiah Willard Gibbs Lecture at the 2025 Joint Mathematics Meetings on “Mathematical Obstacles on the Way to Human-Level AI.” Th...
video · Added on July 22, 2025
· 1:01:48 · 11.3K views
ACCU Membership: https://tinyurl.com/ydnfkcyn
Sponsored By think-cell: https://www.think-cell.com/accu
https://accu.org
Keynote: Development Environments Shape...
video · Added on July 22, 2025
· 1:30:46 · 1.6K views
ACCU Membership: https://tinyurl.com/ydnfkcyn
Join us for ACCU Conference 2025 - 1st-4th April - Online & in Bristol, UK
https://accuconference.org/
---
Not Yo...
video · Added on July 22, 2025
· 1:45:39 · 331.5K views
Hosted by the Acquired podcast, "Live at NVIDIA GTC With Acquired," this special pregame broadcast featured luminary speakers offering valuable insights into NV...
video · Added on July 22, 2025
· 51:12 · 494.4K views
As part of the Manufacturing@MIT Distinguished Speaker Series, Dr. Morris Chang SB ’52, SM ’53, ME ‘55, founder and former chairman and CEO of TSMC (Taiwan Semi...
video · Added on July 22, 2025
· 19:16 · 149.4K views
Links:
- Patreon (Support the channel directly!): https://www.patreon.com/Asianometry
- X: https://twitter.com/asianometry
- Bluesky: https://bsky.app/profile/a...
video · Added on July 22, 2025
· 23:01 · 376 views
In this insightful panel, pioneering architects behind CUDA—including Gregory Diamos, Davor Capalija, Micah Villmow, and Nicholas Wilt—explore the rise of NVID...
video · Added on July 22, 2025
· · views
In this video I look back on my told topology assignments from 2017 and tear them apart.
Submit your garbage proofs to:
ktheorytutoring@gmail.com
Uploader: K-T...
video · Added on July 22, 2025
· 21:31 · 16.0K views
https://arcprize.org/arc-agi#arc-agi-2
Play ARC-AGI: https://arcprize.org/play?task=1ae2feb7
ARC-AGI-2 was launched on March 24, 2025. This second edition in t...
video · Added on July 22, 2025
· 52:51 · 274 views
Abstract: When every big company such as Microsoft, Amazon, Google, Meta, Apple, ..., is rushing to buy Nvidia GPUs to train and run their AI models, who are go...
video · Added on July 22, 2025
· 29:59 · 20.5K views
Interesting stuff coming out of NVIDIA's event. The Analyst Q&A with Jensen was particularly eye-opening.
[00:00] Keynotes
[01:49] Does NVIDIA Build GPUs Anymo...
video · Added on July 22, 2025
· · views
AMD Core Innovation Summit May 2014
Jim Keller, the architect behind the original Athlon 64 and Athlon XP talks about the exciting new Zen architecture that has...
video · Added on July 22, 2025
· 43:28 · 135.6K views
This presentation was recorded at GOTO Chicago 2024. #GOTOcon #GOTOchgo
https://gotochgo.com
Matt Godbolt - Low-level Latency Geek @MattGodbolt
RESOURCES
htt...
video · Added on July 22, 2025
· 36:21 · 30.4K views
This presentation was recorded at YOW! Australia 2024. #GOTOcon #YOW
https://yowcon.com
Yan Chernikov - Director at Studio Cherno @TheCherno
RESOURCES
https:...
video · Added on July 22, 2025
· 32:51 · 4.5K views
On this episode of Approximately Correct, we talk about Richard S. Sutton's AI journey and with his peers about his recent Turing Award.
Want to learn from AI ...
video · Added on July 22, 2025
· 11:47 · 872.6K views
A team of amateurs recently came together in an online collaboration called the Busy Beaver Challenge to pin down the value of BB(5), the fifth "busy beaver" nu...
video · Added on July 22, 2025
· 59:25 · 1.9K views
Dhanya Sridhar (IVADO + Université de Montréal + Mila)
https://simons.berkeley.edu/talks/dhanya-sridhar-ivado-universite-de-montreal-mila-2025-04-16
Safety-Guar...
video · Added on July 22, 2025
· 55:56 · 25.2K views
Conference talk that surveys historical and emerging trends in programming languages, providing contextual and educational value for software engineers.
video · Added on July 22, 2025
· · views
Scaling beyond a single GPU can be challenging, but it's not as hard as you might think! Join us on a journey exploring multi-GPU libraries that simplify the pr...
video · Added on July 22, 2025
· · views
In this video I RIP in viewers GARBAGE proofs. If you want your proofs roasted, or you want 1-on-1 tutoring, please email me at
ktheorytutoring@gmail.com
Intr...
video · Added on July 22, 2025
· 40:05 · 8.7K views
The evolution of artificial intelligence has seen remarkable milestones, particularly in developing systems capable of advanced reasoning. This panel will explo...
video · Added on July 22, 2025
· · views
Support this channel at:
https://buymeacoffee.com/simonoz
Code for animations:
https://github.com/SzymonOzog/GPU_Programming
Code for kernels and benchmarks:
...
video · Added on July 22, 2025
· 19:36 · 182.1K views
Hypergraphs can have any number of dimensions. They can be 2-dimensional, 3-dimensional, 4.81-dimensional or, in the limit, ∞-dimensional.
So how does the thre...
video · Added on July 22, 2025
· 1:04:50 · 16.6K views
We discuss quantum computing, Turing machines, multiverses, consilience, interdisciplinarity, and the state of academia among other fascinating topics.
David w...
video · Added on July 22, 2025
· 1:29:06 · 49.9K views
I had the pleasure of sitting down with David Deutsch in his lovely garden in Oxford a few months ago. Here’s our conversation.
Support this podcast: http://bu...
video · Added on July 22, 2025
· 1:24:07 · 29.0K views
David Deutsch is the founder of the field of quantum computing and the author of The Beginning of Infinity and The Fabric of Reality.
Read me Contra David, on ...
video · Added on July 22, 2025
· 1:06:43 · 49.2K views
Speaker: Scott Aaronson, Department of Computer Science, University of Texas, Austin
Title: How Much Math Is Knowable?
Abstract: Theoretical computer science ...
video · Added on July 22, 2025
· · views
Time to go to the next level!
Uploader: Reclaiming Curiosity
Duration: 3695s
Views: 505
video · Added on July 22, 2025
· · views
James B. Keller is a microprocessor engineer best known for his work at AMD and Apple. He was the lead architect of the AMD K8 microarchitecture and was involve...
video · Added on July 22, 2025
· 1:24:57 · 29.3K views
In this episode of Unsupervised Learning, we sit down with Dylan Patel, Chief Analyst at SemiAnalysis, to break down what these sweeping changes really mean. Fr...
video · Added on July 22, 2025
· 1:34:22 · 7.5K views
A talk I gave to my MATS 8.0 training program on thinking models.
Thinking models seem like a really big deal! Why are they such an improvement? What does this...
video · Added on July 22, 2025
· 45:46 · 5.4K views
Uploader: Spatial ML Seminar Series
Duration: 2746s
Views: 5364
video · Added on July 22, 2025
· 42:43 · 136.7K views
The principle of Propositions as Types links logic to computation. At first sight it appears to be a simple coincidence---almost a pun---but it turns out to be ...
video · Added on July 22, 2025
· 36:54 · 1.9M views
Qubits, state vectors, and Grover's algorithm for search.
Instead of sponsored ad reads, these lessons are funded directly by viewers: https://3b1b.co/support
A...
video · Added on July 22, 2025
· 11:01 · 63.2K views
Concise technical analysis of a forthcoming ARMv9 CPU, covering micro-architectural features, packaging, and compiler strategy—highly relevant to computer-archi...
video · Added on July 22, 2025
· 10:19 · 360.6K views
Smooth lesson from my smooth brain about...
CHANNEL LINKS
🗞️ Substack — https://verynormal.substack.com
☕ Buy me a Ko-fi! — https://ko-fi.com/verynormal
USEFU...
video · Added on July 22, 2025
· 30:30 · 32.9K views
This video has a page on 0DE5 with exercises and resources
https://www.0de5.net/stimuli/a-reintroduction-to-programming/instructions-to-languages/grammars-parsi...
video · Added on July 22, 2025
· · views
TT-Forge is Tenstorrent’s MLIR-based compiler. Learn how TT-Forge integrates with our AI software stack, why we’re building on MLIR, and the features that make ...
video · Added on July 22, 2025
· 56:26 · 14.6K views
An in-depth look at Anthropic's Transformer Circuit Blog Post
Part 1 here: https://youtu.be/mU3g2YPKlsA
Discord here: https;//ykilcher.com/discord
https://tran...
video · Added on July 22, 2025
· 18:29 · 5.9K views
Current and future hardware requirements for next generation frontier models.
Recorded live in San Francisco at the AI Engineer World's Fair. See the full sche...
video · Added on July 22, 2025
· 1:07:54 · 8.6K views
If Charles Dickens was alive in 2024, A Tale of Two Cities might be the divide between the “GPU poor” and the “GPU rich”.
We mentioned these terms in some of o...
video · Added on July 22, 2025
· 1:27:38 · 15.5K views
Talk kindly contributed by Stephen Wolfram in SEMF's 2024 Interdisciplinary Summer School:
https://semf.org.es/school2024
TALK ABSTRACT
A talk on the recent d...
video · Added on July 22, 2025
· 2:13:11 · 3.1K views
Slides: https://drive.google.com/file/d/1pIVJDkohQUt1ZawQvzzR7Wi3cRmyE8Pa/view?usp=sharing
Uploader: GPU MODE
Duration: 7991s
Views: 3087
video · Added on July 22, 2025
· 38:04 · 4.3K views
This interview was recorded for the GOTO Book Club. #GOTOcon #GOTObookclub
http://gotopia.tech/bookclub
Read the full transcription of the interview here:
http...
video · Added on July 22, 2025
· · views
Full Course:
Developer Productivity, v2: https://www.frontendmasters.com/courses/developer-productivity-v2/?utm_source=youtube&utm_medium=course_link&utm_campai...
video · Added on July 22, 2025
· 34:38 · 68.4K views
In this experiment, I took a statement in universal algebra that a collaborator of mine (Bruno Le Floch) on the Equational Theories Project had written a one-pa...
video · Added on July 22, 2025
· 1:01:56 · 18.9K views
Visit our website: https://aprendemosjuntos.bbva.com/
Subscribe to our youtube channel: https://www.youtube.com/channel/UCI6Q...
Visit our website: https://apre...
video · Added on July 22, 2025
· 1:19:58 · 137.6K views
Uploader: Lex Fridman
Duration: 4798s
Views: 137552
video · Added on July 22, 2025
· · views
Please check out Numerai - our sponsor @
http://numer.ai/mlst
Patreon: https://www.patreon.com/mlst
Discord: https://discord.gg/ESrGqhf5CB
Professor Jürgen Sc...
video · Added on July 22, 2025
· 2:34:23 · 4.1K views
Note: Sorry about the video quality! When I'm properly coding I zoom in so it should be readable, but could be better.
A session with some of my MATS 8.0 train...
video · Added on July 22, 2025
· 2:27:14 · 2.2K views
Despite the informal Twitch-stream format, the content is a hands-on walkthrough of PCB design/fabrication, a valuable technical skill for hardware/embedded eng...
video · Added on July 22, 2025
· · views
Recorded 07 November 2024. Kevin Ellis of Cornell University presents "Probabilistic Thinking in Language and Code" at IPAM's Naturalistic Approaches to Artific...
video · Added on July 22, 2025
· 1:00:51 · 12.3K views
Edward Witten will consider the algebra of observables along the worldline of an observer as a background independent algebra in quantum gravity.
Uploader: Simo...
video · Added on July 22, 2025
· · views
Highlights from the Modular GPU Kernel Hackathon! 💥
Developers gathered at AGI House to build cutting-edge Mojo kernels on AMD Instinct™ MI300X, from LLM traini...
video · Added on July 22, 2025
· · views
When the universe branches, we branch with it.
Those branches don’t remain forever apart. They come back together.
So we, as conscious observers, are rescued ...
video · Added on July 22, 2025
· 27:03 · 3.0K views
In today's video, I wanted to cover context windows in the transformer's architecture and how to make them BIG.
# Table of Content
- Introduction: 0:00
- Why m...
video · Added on July 22, 2025
· 31:44 · 46.1K views
Following on from the previous video at https://www.youtube.com/watch?v=cyyR7j2ChCI, I now attempt to formalize a different proof of the same assertion using th...
video · Added on July 22, 2025
· 11:33 · 50.7K views
DON'T memorize this – understand it. Matrix multiplication isn’t just some weird set of rules. It's a carefully designed system that fits real-world problems pe...
video · Added on July 22, 2025
· 37:40 · 64.3K views
The tensor product of vector spaces (or modules over a ring) can be difficult to understand at first because it's not obvious how calculations can be done with ...
video · Added on July 22, 2025
· 11:40 · 71.7K views
In this video, we dive into the world of autoencoders, a fundamental concept in deep learning. You'll learn how autoencoders simplify complex data into essentia...
video · Added on July 22, 2025
· 1:56:32 · 9.6K views
In this video, Daniel Cumming a formal verification engineer at Runtime Verification and Rust instructor at RareSkills explains how the Rust compiler works unde...
video · Added on July 22, 2025
· 1:12:35 · 16.1K views
Stephen Wolfram is a prominent computer scientist and theoretical physicist, best known for developing Mathematica and authoring A New Kind of Science. Today, w...
video · Added on July 22, 2025
· 2:09:04 · 10.6K views
In this series we hunt for the backdoor that the NSA allegedly uses in order to crack AES encryption. The backdoor is inside of Intel (and AMD) CPUs and today w...
video · Added on July 22, 2025
· · views
Tiled (general) Matrix Multiplication from scratch in CUDA C.
Code Repo: https://github.com/tgautam03/CUDA-C/tree/master/05_tiled_mat_mul
Notes: https://0mean1...
video · Added on July 22, 2025
· · views
Watch me start writing an entirely new #programming #language and #codegen from scratch! You can support my work at: http://patreon.com/renerebe https://github....
video · Added on July 22, 2025
· · views
To try everything Brilliant has to offer—free—for a full 30 days, visit https://brilliant.org/FloatHeadPhysics . You’ll also get 20% off an annual premium subsc...
video · Added on July 22, 2025
· 11:13 · 5.1M views
AI Competes in a 100m Dash!
In this video 5 AI Warehouse agents compete to learn how to run 100m the fastest. The AI were trained using Deep Reinforcement Lear...
video · Added on July 22, 2025
· · views
❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambda.ai/papers
Guide for using DeepSeek on Lambda:
https://docs.lambdalabs.com/education/la...
video · Added on July 22, 2025
· · views
Conference talk detailing PyTorch backend architecture, custom ops, and performance tuning—highly relevant for deep-learning engineers and compiler enthusiasts.
video · Added on July 22, 2025
· 11:57 · 10.1K views
Uploader: NotImplemented
Duration: 717s
Views: 10124
video · Added on July 22, 2025
· 41:15 · 320.7K views
Course: https://www.udemy.com/course/introduction-to-power-system-analysis/?couponCode=KELVIN ✅
If you want to support me to make more frequent videos, consider...
video · Added on July 22, 2025
· · views
Panel discussion by experienced developers on the merits and pitfalls of Test-Driven Development provides practical insights and real-world anecdotes valuable t...
video · Added on July 22, 2025
· 59:48 · 118.8K views
This video was sponsored by Brilliant.
To try everything Brilliant has to offer—free—for a full 30 days, visit https://brilliant.org/Tariq10x/ . You’ll also get...
video · Added on July 22, 2025
· 23:16 · 18.5K views
In this video, I break down DeepSeek's Group Relative Policy Optimization (GRPO) from first principles, without assuming prior knowledge of Reinforcement Learni...
video · Added on July 22, 2025
· 15:06 · 772.1K views
I found a way to turn Portal 2 into a web server.
Thank you for watching!
This project is available on GitHub: https://github.com/p2r3/HTTPortal
Join our Disco...
video · Added on July 22, 2025
· 32:05 · 28.2K views
The first 500 people to use my link https://skl.sh/deepia05251 will get a 1 month free trial of Skillshare!
In this video you'll learn everything about the DDP...
video · Added on July 22, 2025
· 1:03:00 · 26.2K views
In-depth technical walkthrough of Transformer architecture by an AI researcher, directly aligned with deep-learning educational content.
video · Added on July 22, 2025
· 38:11 · 49.4K views
In this video we are looking at Diffusion Models from a different angle, namely through Score-Based Generative Models, which arguably can be considered as the b...
video · Added on July 22, 2025
· 34:48 · 1.2M views
Visit https://brilliant.org/Reducible/ to get started learning STEM for free, and the first 200 people will get 20% off their annual premium subscription.
Cha...
video · Added on July 22, 2025
· 13:25 · 24.6K views
Donate to Closer To Truth and help us keep our content free and without paywalls: https://shorturl.at/OnyRq
Does ordinary stuff have mysterious properties? Tak...
video · Added on July 22, 2025
· · views
SemiAnalysis' Dylan Patel presentatng on Multi-Datacenter Training at the Decentralized AI Day 2025 event hosted by Prime Intellect
Uploader: Prime Intellect AI...
video · Added on July 22, 2025
· 1:54:16 · 250.0K views
Leading physicist Raphael Bousso joins Brian Greene to explore the almost unreasonable capacity of our theories of gravity to give deep insights into quantum ph...
video · Added on July 22, 2025
· 26:23 · 276.5K views
Get 4 months extra on a 2 year plan here: https://nordvpn.com/artemkirsanov. It’s risk free with Nord’s 30 day money-back guarantee!
Socials:
X/Twitter: https...
video · Added on July 22, 2025
· 19:31 · 148.1K views
Head to https://squarespace.com/artem to save 10% off your first purchase of a website or domain using code ARTEM
Socials:
X/Twitter: https://x.com/ArtemKRSV
P...
video · Added on July 22, 2025
· · views
Despite the humorous style, this is a university lecture on foundational questions in physics delivered by a researcher, which fits the collection’s focus on te...
video · Added on July 22, 2025
· 11:16 · 110.8K views
It's been a little over 2 years since my last Oxidise your Command Line video, and so it's about time for an update!
Today I have 30 rust-powered command line ...
video · Added on July 22, 2025
· · views
Discord server: https://discord.gg/AqHbaeK43b
Donations: https://ko-fi.com/vimjoyer
Code from the video: https://github.com/vimjoyer/flake-starter-config
Also...
video · Added on July 22, 2025
· 28:28 · 940.4K views
ERRATA:
• The "Church-Turing Thesis" is different from the "Church-Turing Theorem". The "theorem" is the claim which I discussed in the video- namely, that the ...
video · Added on July 22, 2025
· 41:14 · 25.8K views
Come for an introduction to programming the GPU by the lead architect of CUDA. CUDA's unique in being a programming language designed and built hand-in-hand wit...
video · Added on July 22, 2025
· 6:01 · 40.9K views
CUDA Teaching Center
Oklahoma State University ECEN 4773/5793
Uploader: Josh Holloway
Duration: 361s
Views: 40934
video · Added on July 22, 2025
· 23:39 · 133.5K views
Comprehensive tutorial explaining Nix package manager, deterministic builds, and related tooling, valuable for devops and reproducible development.
video · Added on July 22, 2025
· 23:21 · 37.7K views
Andrej Karpathy shared the story of how he built llm.c and encouraged the audience to build more reference architectures that can fit in an LLMs context length....
video · Added on July 22, 2025
· 3:14:34 · 1.1M views
Terence Tao is widely considered to be one of the greatest mathematicians in history. He won the Fields Medal and the Breakthrough Prize in Mathematics, and has...
video · Added on July 22, 2025
· · views
Donate to Closer To Truth and help us keep our content free and without paywalls: https://shorturl.at/OnyRq
Emergence is how the world works differently at var...
video · Added on July 22, 2025
· 2:01:07 · 12.5K views
Nic Barker is a self-taught programmer who went from web development to building Clay, a fast UI layout library in C. We talk about how he got started, his jour...
video · Added on July 22, 2025
· 52:28 · 27.7K views
This is my entry to #SoME4, 3Blue1Brown's Summer of Math Exposition Competition!
Diffusion models are typically portrayed as models that learn to denoise a cor...
video · Added on July 22, 2025
· 1:01:04 · 29.8K views
Mitchell takes us through his AI workflow and we dive into a recent commit of his: https://github.com/ghostty-org/ghostty/commit/3de3f48faf830fe1326f44b08fb9f27...
video · Added on July 22, 2025
· · views
ICML 2024 Tutorial
"Machine Learning on Function spaces #NeuralOperators"
Abstract:
This tutorial will introduce neural operators, an extension of neural net...
video · Added on July 22, 2025
· · views
High-Frequency Trading System (HFT) are the bleeding edge of real-time systems — HFT architecture is designed for nanosecond-level execution, not just milliseco...
video · Added on July 22, 2025
· 31:11 · 124.6K views
Breakthrough Discuss is an annual academic conference focused on life in the Universe and novel ideas for space exploration. Breakthrough Discuss 2025: Life As ...
video · Added on July 22, 2025
· · views
Make a donation to Closer To Truth to help us continue exploring the world's deepest questions without the need for paywalls: https://shorturl.at/OnyRq
Conside...
video · Added on July 22, 2025
· 37:43 · 35.6K views
Open source has revolutionized software. Now it is hardware's turn. This talk will present today's chip design economics, introduce the free and open RISC-V ins...
video · Added on July 22, 2025
· · views
Make a donation to Closer To Truth to help us continue exploring the world's deepest questions without the need for paywalls: https://shorturl.at/OnyRq
“Power ...
video · Added on July 22, 2025
· 1:15:45 · 296.3K views
IT’S ALL ABOUT MATH!
An ongoing series hosted by The Department of Mathematics of the University of Toronto
How playing games led to more numbers than anybody ...
video · Added on July 22, 2025
· 2:26:03 · 24.6K views
Dr. Dr. h.c. Joscha Bach (*1973 Weimar, DDR) Humboldt-Universität, Forschung am MIT und Harvard, Kognitionswissenschaftler, KI-Forscher und Philosoph. Bach ist...
video · Added on July 22, 2025
· 23:33 · 88.7K views
To try everything Brilliant has to offer—free—for a full 30 days, visit https://brilliant.org/ArtemKirsanov . You’ll also get 20% off an annual premium subscrip...
video · Added on July 22, 2025
· 31:11 · 14.5K views
at Napoleonic hall of Brera palace
Uploader: Università degli Studi dell'Insubria
Duration: 1871s
Views: 14453
video · Added on July 22, 2025
· · views
80,000 Hours want to help you find a fulfilling career that makes a positive difference in the world: https://80000Hours.org/bensyversen
How a numerical table ...
video · Added on July 22, 2025
· · views
Von Neumann’s Game Theory: How Strategic Thinking Changed Business, Politics, and Global Power
Welcome to History with BMResearch. In this video, we explore ho...
video · Added on July 22, 2025
· 41:42 · 2.2K views
*tl;dr: a programming language for documents*
James shares a bit about Typst as an excited newcomer to the 'documents as code' ecosystem, while simultaneously ...
video · Added on July 22, 2025
· 24:23 · 56.6K views
Diffusion models are a key innovation with far-reaching impacts on multiple fields in machine learning, being the technology behind OpenAI's DALL-E and Sora, Go...
video · Added on July 22, 2025
· · views
Go to https://www.pcbway.com for all your CNC machining needs, and make the hard parts simple.
A long overdue upgrade to the homemade CNC milling machine, amon...
video · Added on July 22, 2025
· 47:52 · 2.3M views
Let's try to convince a bunch of particles to behave (at least somewhat) like water.
Written in C# and HLSL, and running inside the Unity engine.
Source code:
...
video · Added on July 22, 2025
· 31:52 · 3.3K views
Song Han, Associate Professor, MIT Electrical Engineering and Computer Science, on accelerating large language model and generative AI.
Han’s talk was part of ...
video · Added on July 22, 2025
· · views
This video is part of the Machine Learning series taught by Prof. Hamprecht at Heidelberg University during the winter term 2024/2025.
Lecture Structure:
00:00...
video · Added on July 22, 2025
· · views
Speaker: Yuka Ikarashi
Uploader: GPU MODE
Duration: 4115s
Views: 1072
video · Added on July 22, 2025
· 28:20 · 320 views
Uploader: GOSIM Foundation
Duration: 1700s
Views: 320
video · Added on July 22, 2025
· · views
How the deformation mapping and the deformation gradient are used to mathematically describe deformation - with many visual examples.
This is the first video o...
video · Added on July 22, 2025
· 1:14:48 · 1.6K views
Speaker, institute & title
1) Alex Alberts, Purdue University, Information field theory for solving Bayesian inverse problems
Uploader: CRUNCH Group: Home of M...
video · Added on July 22, 2025
· · views
Goal.
I would like to tell you a bit about my favorite theorems, ideas or concepts in mathematics and why I like them so much.
This time.
What is...homotopy o...
video · Added on July 22, 2025
· 9:58 · 65.0K views
Remove your personal information from the web at https://JoinDeleteMe.com/BYCLOUD and use
code BYCLOUD for 20% off🙌
In this video, we take a look at this re...
video · Added on July 22, 2025
· 58:52 · 6.1K views
Dr. Karl Friston, University College London, applies the free energy principle to set forth an account of life, or self-organization, in terms of active inferen...
video · Added on July 22, 2025
· 28:10 · 4.5K views
Gordon Moore, co-founder of Intel Corporation, to whom the term "Moore's Law" is attributed, speaks about the ubiquitous microchip. Series: "Frontiers of Knowle...
video · Added on July 22, 2025
· 53:06 · 27.1K views
Fireside Chat With Ilya Sutskever and Jensen Huang AI Today and Vision of the Future March 2023
Uploader: Jason MJ (MJ)
Duration: 3186s
Views: 27119
video · Added on July 22, 2025
· 59:11 · 98.0K views
In this Presidential Lecture, Terence Tao will survey historical and recent developments in the use of machines in mathematics. He will also speculate on the fu...
video · Added on July 22, 2025
· 1:35:36 · 399 views
Conference talk presenting advanced C++ design techniques for declarative programming, directly relevant to professional software development.
video · Added on July 22, 2025
· 1:18:13 · 7.6K views
Our last AI PhD grad student feature was Shunyu Yao, who happened to focus on Language Agents for his thesis and immediately went to work on them for OpenAI. Ou...
video · Added on July 22, 2025
· · views
In this video, i'll show you how you would go about writing a terminal emulator from scratch in C, as it's hard to find similar content to this :)
You can get...
video · Added on July 22, 2025
· 1:02:17 · 214.2K views
Dylan Patel breaks down the current chaos inside the world’s top AI companies. Dylan is the founder and CEO of SemiAnalysis, one of the best analyst firms cover...
video · Added on July 22, 2025
· 2:53:59 · 10.6K views
A CIMC Salon at the Internet Archive in San Francisco
Uploader: California Institute for Machine Consciousness
Duration: 10439s
Views: 10649
video · Added on July 22, 2025
· 1:49:02 · 7.8K views
University lecture by cognitive scientist Joscha Bach examining AI architecture and machine consciousness; fits educational and technical focus on cognition and...
video · Added on July 22, 2025
· · views
This is a visualization of all the neurons of a neural network as it is trained to learn 4 periods of the sine function.
The neural network has 14 layers. The i...
video · Added on July 22, 2025
· · views
Developer devlog that dives into algorithmic and performance considerations for simulating large numbers of NPCs, providing practical insights into game-engine ...
video · Added on July 22, 2025
· 39:23 · 75.5K views
Abstract:
I will try to share a glimpse of this strange unification of many different ideas. This talk is aimed at a general audience, and no particular backgro...
video · Added on July 22, 2025
· 46:57 · 13.7K views
Here are some of Devine's disjointed thoughts on the creation of a clean-slate computing stack based on the universal virtual machine strategy for digital prese...
video · Added on July 22, 2025
· 1:02:36 · 4.3K views
This is a technical presentation about diffusion language models, a relatively new approach to text generation that differs fundamentally from traditional autor...
video · Added on July 22, 2025
· 1:13:00 · 100.9K views
#LambdaConf2025 took place in Estes Park Colorado this past May 12th and 13th.
UPCOMING EVENT:
The Ultimate Coder
Casting call: https://docs.google.com/forms/d...
video · Added on July 22, 2025
· 21:37 · 64.4K views
To try everything Brilliant has to offer—free—for a full 30 days, visit https://brilliant.org/ArtemKirsanov/ .
You’ll also get 20% off an annual premium subscri...
video · Added on July 22, 2025
· 40:14 · 27.9K views
Type theory is one of the central ideas in theoretical computer science and formal linguistics. But what is it, where did it come from, and how does it work? We...
video · Added on July 22, 2025
· 43:25 · 1.4K views
In this episode targeted at beginners, we show the end-to-end application development process, starting from an empty directory. We'll consider package configur...
video · Added on July 22, 2025
· 50:14 · 17.5K views
Algebraic geometry seminar
Department of Pure Mathematics
University of Waterloo
September 22nd, 2016
Following the notes of Ravi Vakil, available at http://mat...
video · Added on July 22, 2025
· 34:01 · 2.8K views
www.pydata.org
Diving into Transformer Model Internals
While everybody and their dog is building applications on generative AI, the inner workings of tran...
video · Added on July 22, 2025
· · views
If this question bothers you, I believe you need to learn to work with your brain, not against it.
Uploader: Math-life balance
Duration: 191s
Views: 3152
video · Added on July 22, 2025
· 54:25 · 1.2K views
[C++ Under the Sea 2024 conference]
https://cppunderthesea.nl/
11th of October 2024
Video recording sponsored by think-cell: https://www.think-cell.com
[T...
video · Added on July 22, 2025
· 8:15 · 25.6K views
Clear technical explanation of FPGA architecture and its use in low-latency HFT systems, fitting the hardware and systems design focus.
video · Added on July 22, 2025
· 1:34:57 · 12.4K views
Episode #4 of Yesterday:
Dylan Patel is the founder of SemiAnalysis, the leading research and consulting firm for AI infrastructure and buildouts. Their public...
video · Added on July 22, 2025
· 1:09:29 · 1.0K views
Speaker: Joe Fioti
Uploader: GPU MODE
Duration: 4169s
Views: 1039
video · Added on July 22, 2025
· 54:34 · 16.5K views
In this session, I will explore some playful low-power, sometimes analog, computation systems and esoteric programming languages, designed to work offline, on s...
video · Added on July 22, 2025
· · views
To try everything Brilliant has to offer—free—for a full 30 days, visit https://brilliant.org/GalLahat/ . You’ll also get 20% off an annual premium subscription...
video · Added on July 22, 2025
· 44:14 · 14.4K views
Jordan Wolfe sits down with Dylan Patel, Founder and Chief Analyst at SemiAnalysis, a research and consulting firm specializing in semiconductor and other AI-in...
video · Added on July 22, 2025
· 1:10:36 · 79.3K views
Go to http://piavpn.com/torscabinet to get 83% off from our sponsor Private Internet Access with 4 months free! Featuring technology invented by Bell Labs, like...
video · Added on July 22, 2025
· 40:43 · 16.5K views
This presentation was recorded at GOTO Aarhus 2023. #GOTOcon #GOTOaar
https://gotoaarhus.com
Magnus Madsen - Assistant Professor at Aarhus University
RESOURCE...
video · Added on July 22, 2025
· 17:13 · 23.7K views
The purpose of monads and their alternatives (old and new). For people who love monads, hate monads, and those who don’t get them.
Make FP click by joining “Ho...
video · Added on July 22, 2025
· 1:22:51 · 18.0K views
Kit Langton explores algebraic effects, emphasizing separating syntax from semantics. Kit demonstrates this through the Kyo library, discussing syntax and seman...
video · Added on July 22, 2025
· 36:14 · 311.7K views
Links:
- Patreon (Support the channel directly!): https://www.patreon.com/Asianometry
- X: https://twitter.com/asianometry
- Bluesky: https://bsky.app/profile/a...
video · Added on July 22, 2025
· 22:01 · 729.5K views
Links:
- The Asianometry Newsletter: https://www.asianometry.com
- Patreon: https://www.patreon.com/Asianometry
- Threads: https://www.threads.net/@asianometry
...
video · Added on July 22, 2025
· 16:58 · 170.4K views
How a small Belgian town became the center of semicondcutor innovation. From High-NA EUV to silicon photonics. A closer look at imec.
Become a supporter on Pa...
video · Added on July 22, 2025
· 18:57 · 3.6M views
In 1986, the Soviet Union had slightly more than 10,000 computers. The Americans had 1.3 million.
At the time of Stalin's death, the Soviet Union was the worl...
video · Added on July 22, 2025
· 13:32 · 19.0K views
I'm sure we've all heard of the Yoneda Lemma before, but why does it get so much hype in category theory when the result itself is quite elementary to prove?
W...
video · Added on July 22, 2025
· 34:34 · 91.1K views
Links:
- Patreon (Support the channel directly!): https://www.patreon.com/Asianometry
- X: https://twitter.com/asianometry
- Bluesky: https://bsky.app/profile/a...
video · Added on July 22, 2025
· 2:27:33 · 271.0K views
Casey Muratori's talk at BSC 2025.
Casey's links:
- https://ComputerEnhance.com/
- https://x.com/cmuratori/
BSC links:
- https://BetterSoftwareConference.com/...
video · Added on July 22, 2025
· 23:42 · 417.0K views
This video’s sponsor Brilliant is a great way to learn more. You can try Brilliant for free for thirty days by visiting https://brilliant.org/WelchLabs and the...
video · Added on July 22, 2025
· 8:42 · 26.0K views
In this video, we generalize Euclidean vector space to obtain Hilbert spaces. In the process, we come across Bessel's inequality and Parseval's identity. The th...
video · Added on July 22, 2025
· 23:57 · 61.7K views
In this video, we aim to explore the full geometric meaning of the second derivative test through the lens of linear algebra. We'll uncover the power of the spe...
video · Added on July 22, 2025
· 3:00:21 · 5.6M views
Michael Levin is a biologist at Tufts University working on novel ways to understand and control complex pattern formation in biological systems. Please support...
video · Added on July 22, 2025
· 47:51 · 14.6K views
In-depth review of a recent research paper on Energy-Based Transformers, offering technical insights into advanced deep-learning architectures.
video · Added on July 22, 2025
· · views
Will your favorite roguelikes still be playable in 20 years? How about 50? The Uxn/Varvara ecosystem is a radically simple, portable, and personal approach to c...
video · Added on July 22, 2025
· 27:14 · 2.1M views
Jacobian matrix and determinant are very important in multivariable calculus, but to understand them, we first need to rethink what derivatives and integrals me...
video · Added on July 22, 2025
· 1:07:08 · 26.8K views
Conference talk detailing techniques for parallelizing a physics solver; highly relevant to concurrency and performance optimization in software engineering.
video · Added on July 22, 2025
· 14:44 · 523.9K views
To learn more about various areas of Group Theory: https://en.wikipedia.org/wiki/Group_theory
Galois Theory article in Encyclopedia of Mathematics: https://en...
video · Added on July 22, 2025
· 29:08 · 1.1K views
In this episode we dive into Pauli Spinors, Weyl Spinors, Dirac Spinors, and Majorana Spinors. We talk about their difference and later we compare them to twis...
video · Added on July 22, 2025
· 1:08:23 · 24.0K views
Advait Shinde discusses the history of the theory of computation, delving into axiomatic thinking, Peano axioms, Turing Machines, Lambda Calculus, the Y Combina...
video · Added on July 22, 2025
· 54:16 · 15.1K views
The Lambda Calculus is a tiny symbol manipulation system which suffices to compute anything Turing-computable. Thanks to this expressive power, LC is woven into...
video · Added on July 22, 2025
· 2:10:52 · 3.7K views
The great Stephen Wolfram spends 2 hours discussing the ruliads and computational irreducibility at the Real World Risk Institute Summer School.
Uploader: N N T...
video · Added on July 22, 2025
· 21:01 · 126.5K views
Visual, high-level explanation of scaled dot-product attention and why it enables large language models to capture long-range dependencies.
video · Added on July 22, 2025
· · views
This video has a list of books, videos, and exercises that goes through the undergrad pure mathematics curriculum from start to finish.
---
REAL ANALYSIS
Boo...
video · Added on July 22, 2025
· · views
SUBSCRIBE CHANNEL: https://bit.ly/AIInsightNews
-----------------
OpenAI has released Stable Code 3B, a new Large Language Model (LLM) designed for code complet...
video · Added on July 22, 2025
· 54:43 · 1.2M views
Historical deep-dive into the MP944 F-14 Central Air Data Computer and its claim as the first microprocessor, including architecture and programming model.
video · Added on July 22, 2025
· 31:17 · 1.0M views
Explains how CPUs implement hardware timers, covering divider chains, counter registers, interrupts, and their use in precise timing control.
video · Added on July 22, 2025
· 36:55 · 2.0M views
Andrew Ng outlines current AI trends, enterprise adoption patterns, and startup opportunities with an emphasis on data-centric supervised learning.
video · Added on July 22, 2025
· · views
As the race to build the world's first truly useful quantum computer intensifies, so too does the need for clear-eyed assessment. This Field Notes episode bring...
video · Added on July 22, 2025
· 1:45:56 · 479.7K views
Donald Knuth reflects on algorithmic analysis, computational complexity, and insights from The Art of Computer Programming.
video · Added on July 22, 2025
· 58:01 · 38.7K views
Tutorial distills LLM behavior into five key formulas—perplexity, attention, GEMM efficiency, scaling laws, and RASP reasoning.
video · Added on July 22, 2025
· · views
Everything you need to know about using Apple Vision Pro
Vision Pro Review: https://youtu.be/86Gy035z_KA
Apple's Forbidden words: https://youtu.be/kvN5_GXlg2Y?...
video · Added on July 22, 2025
· 33:28 · 2.1M views
Comprehensive survey of quantum computing principles, algorithms, physical qubit implementations, and engineering challenges in the field.
video · Added on July 22, 2025
· 1:28:22 · 2.2M views
Introduces the quantum computing model to computer scientists, deriving qubits, gates, and Deutsch’s algorithm using linear-algebra formalisms.
video · Added on July 22, 2025
· · views
Detailed walkthrough of a developer’s 2024 hardware and Linux software setup, offering practical insights into tooling and workflow valuable to software enginee...
video · Added on July 22, 2025
· 59:01 · 8.1K views
Presents the Assembly Hypothesis—a rigorous mathematical model for emergent learning and computation in cortical neural assemblies.
video · Added on July 22, 2025
· 1:11:40 · 874.4K views
Andrej Karpathy kicks off Stanford CS25 with a primer on Transformer architecture, its history, and cross-domain applications.
video · Added on July 22, 2025
· 31:14 · 4.0M views
3Blue1Brown visually proves and contextualizes the Central Limit Theorem and its importance in probability and data analysis.
video · Added on July 22, 2025
· 31:48 · 1.7M views
Animated overview of the Standard Model, explaining fundamental particles, forces, symmetries, and open questions in modern particle physics.
video · Added on July 22, 2025
· 56:26 · 5.5K views
Research talk on Dense Associative Memory networks, exploring high-capacity energy-based models for pattern storage and retrieval.
video · Added on July 22, 2025
· 9:15 · 13.1K views
Clarifies entropy as a measure of information—not disorder—linking thermodynamic and Shannon definitions via microstate counting.
video · Added on July 22, 2025
· 1:24:07 · 492.9K views
Stephen Wolfram discusses his computational approach to the second law of thermodynamics, entropy growth, and implications for AI governance.
video · Added on July 22, 2025
· 59:05 · 3.9K views
Talk explores reciprocal advances between neuroscience and AI, highlighting how brain insights inform interpretable machine-learning models.
video · Added on July 22, 2025
· 1:00:42 · 2.7K views
Google DeepMind’s Douglas Eck surveys state-of-the-art generative AI systems for music, video, and images, detailing model architectures and datasets.
video · Added on July 22, 2025
· 1:00:06 · 1.1K views
Seminar proves the self-regularizing sparsity of nonparametric MLEs for mixture models, yielding logarithmic bounds on component count.
video · Added on July 22, 2025
· 2:20:15 · 344.4K views
Wide-ranging conversation with Po-Shen Loh about competitive mathematics, combinatorics, and effective strategies for learning and teaching math.
video · Added on July 22, 2025
· · views
Very simple approach for AI architectures with Symbols , learning devices, learnable properties, perception, brain, ANN, SNN...
#machine_learning #deep_learnin...
video · Added on July 22, 2025
· 1:15:04 · 694.6K views
Live coding stream showing workflow, tooling, and patch submission process for Linux kernel maintenance and security back-porting.
video · Added on July 22, 2025
· 16:24 · 140.5K views
Technical overview of memristor technology and its role in power-efficient analog in-memory computing for AI accelerators.
video · Added on July 22, 2025
· 21:43 · 3.6M views
Recreates Feynman’s geometric proof of Keplerian orbits, demonstrating why planetary motion forms ellipses using elementary mechanics.
video · Added on July 22, 2025
· 2:00:34 · 2.8M views
Feature-length documentary exposing the production pipeline, tools, and design decisions behind Naughty Dog’s The Last of Us Part II.
video · Added on July 22, 2025
· · views
Marc Raibert is founder and former long-time CEO of Boston Dynamics, and recently Executive Director of the newly-created Boston Dynamics AI Institute. Please s...
video · Added on July 22, 2025
· 18:03 · 187.2K views
Explores why linear algebra underpins diverse scientific computations, illustrating its unifying power across optimization and physics.
video · Added on July 22, 2025
· · views
Support the channel: https://ko-fi.com/jkzero
Story of how Planck discovered the blackbody radiation formula and why he introduced energy quantization as a math...
video · Added on July 22, 2025
· 13:00 · 1.7M views
Explains how DeepMind’s AlphaTensor discovered record-breaking tensor-decomposition algorithms that speed up matrix multiplication.
video · Added on July 22, 2025
· 1:12:30 · 180.4K views
Jeff Dean reviews recent algorithmic and hardware advances enabling Gemini-class multimodal LLMs and highlights scientific applications.
video · Added on July 22, 2025
· 50:03 · 47.5K views
Paper walk-through of V-JEPA, detailing a predictive video representation model trained without labels for downstream vision tasks.
video · Added on July 22, 2025
· 2:13:34 · 860.3K views
Andrej Karpathy codes a GPT Byte-Pair-Encoding tokenizer from scratch, dissecting Unicode handling and frequency-based merges.
video · Added on July 22, 2025
· 17:06 · 100.2K views
Concise primer on LoRA and QLoRA, showing how low-rank adapters enable parameter-efficient fine-tuning of Transformer models under quantization.
video · Added on July 22, 2025
· · views
30 AI Projects You Can Build This Weekend: https://the-data-entrepreneurs.kit.com/30-ai-projects
This is the 6th video in a series on using large language mode...
video · Added on July 22, 2025
· 15:10 · 5.1M views
Visual geometric interpretation of Bayes’ theorem, illustrating belief updates and prior-posterior relationships in probabilistic inference.
video · Added on July 22, 2025
· 31:53 · 20.6K views
Keynote analyzes the limits of Moore’s Law scaling and advocates mixed-signal and ADC-centric approaches for power-efficient RF/digital design.
video · Added on July 22, 2025
· 10:50 · 10.1K views
Step-by-step visual and PyTorch implementation of the Transformer—covering self-attention, positional encoding, and multi-head mechanisms.
video · Added on July 22, 2025
· 19:56 · 1.2M views
Engineering overview of ITER, detailing magnetic confinement, cryogenics, vacuum vessel construction, and the challenges of large-scale fusion reactors.
video · Added on July 22, 2025
· · views
Get 30% off unlimited access to Ground News, giving you full coverage of breaking news and allowing you to navigate media bias seamlessly 👉 https://www.ground.n...
video · Added on July 22, 2025
· 1:00:35 · 1.3M views
Introductory Stanford lecture surveys brain anatomy, neuronal signaling, and pharmacology, setting foundations for understanding human cognition.
video · Added on July 22, 2025
· 19:22 · 987.6K views
Demonstrates how consistent heuristics transform Dijkstra into the A* algorithm, with proofs, implementation tips, and geographic path-finding demos.
video · Added on July 22, 2025
· · views
🔗 Links 🔗
The Era of 1-bit LLMs:
All Large Language Models are in 1.58 Bits
https://arxiv.org/pdf/2402.17764.pdf
BitNet: Scaling 1-bit Transformers for
Large ...
video · Added on July 22, 2025
· 1:11:45 · 23.3K views
Deep technical interview on Groq’s Language Processing Unit architecture—single-cycle SIMD fabric, compiler stack, and network scaling versus GPUs.
video · Added on July 22, 2025
· 51:15 · 477.1K views
Talk uses geometric algebra to show why no unique general vector-vector product exists in 3-D, highlighting dot, cross, and outer products.
video · Added on July 22, 2025
· · views
Recorded 29 February 2024. Sitan Chen of Harvard University presents "Provably learning a multi-head attention layer" at IPAM's EnCORE Workshop on Computational...
video · Added on July 22, 2025
· · views
Stephen Wolfram joins Brian Greene to explore whether the ultimate theory of the universe might emerge from a computationally simple framework.
This program is...
video · Added on July 22, 2025
· 8:20 · 4.7K views
Hands-on Jupyter notebook walks line-by-line through performing LoRA fine-tuning of a large language model using HuggingFace PEFT.
video · Added on July 22, 2025
· · views
Highlights from the #intel Foundry Direct Connect keynote presentation, featuring Sam Altman from #openai as well as the CEOs of Microsoft ( #msft stock ) and A...
video · Added on July 22, 2025
· · views
Magnetic Braking Looks Like Magic.
See the full video here: https://youtu.be/zU3niMdjegQ
#shorts
Uploader: Action Lab Shorts
Duration: 60s
Views: 17635921
video · Added on July 22, 2025
· 28:46 · 41.0K views
Illustrated guide to Stable Diffusion explaining latent-diffusion training, CLIP text encoders, and reverse-diffusion image generation.
video · Added on July 22, 2025
· 16:56 · 309.4K views
Computerphile coding session builds and tweaks Stable Diffusion models in Python/Colab, clarifying sampler parameters and latent spaces.
video · Added on July 22, 2025
· · views
The full report (PDF): http://math.fau.edu/yiu/Oldwebsites/MPS2010/TerenceTao1984.pdf
Terence did note in his answers that questions 6 and 8 (A & E) at 2:18 can...
video · Added on July 22, 2025
· 33:00 · 2.6M views
Connects quantum/molecular dynamics to macroscopic Navier-Stokes behavior, laying theoretical groundwork for building a CFD simulator.
video · Added on July 22, 2025
· · views
Visit our website: https://datascience.harvard.edu
WATCH IN STANDARD FORMAT: https://youtu.be/k9DnQPrfJQs
One year after the release of GPT-4, large language m...
video · Added on July 22, 2025
· · views
A look at the textbook that math genius Ramanujan read when he was 16, Synopsis of Pure Mathematics is a book by G. S. Carr. This video was sponsored by Brillia...
video · Added on July 22, 2025
· · views
(*) Among current mathematicians, many people regard Professor Terence Tao as the world's finest... Opinions on such things vary, of course.
Professor Tao kindl...
video · Added on July 22, 2025
· 1:56:20 · 6.0M views
End-to-end coding tutorial constructs a minimal GPT Transformer—including dataset, BPE tokenizer, self-attention, and training loop—from scratch.
video · Added on July 22, 2025
· 10:16 · 417.9K views
Shows how simple inter-particle attraction/repulsion rules give rise to complex “Particle Life” behaviors, with open-source code for experimentation.
video · Added on July 22, 2025
· · views
Join CodeCrafters and learn by creating your own: Redis, Git, Http server, Interpreter, Grep... in your favorite programming language:
https://app.codecrafters....
video · Added on July 22, 2025
· 2:53:17 · 79.3K views
Long-form hacking session explores building a toy language front-end targeting the QBE backend, illustrating SSA IR, parsing, and register allocation.
video · Added on July 22, 2025
· 1:46:04 · 32.7K views
George Hotz refactors tinygrad’s linearizer, exposing low-level tensor compiler optimizations that map high-level ops to efficient GPU kernels.
video · Added on July 22, 2025
· 49:13 · 3.1M views
Details how a 6502 CPU fetches and decodes machine instructions, covering control lines, micro-sequencing, and timing states at the transistor level.
video · Added on July 22, 2025
· 17:56 · 449.0K views
The Busy Beaver game, pointless? Or a lesson in the problems of computability? - How do you decide if something can be computed or not?
Professor Brailsford's ...
video · Added on July 22, 2025
· 12:34 · 97.3K views
The Anthropic prompt is to help you create better prompts
MetaPrompt Colab: https://drp.li/8Odmv
Meta Prompt colab Original: https://colab.research.google.com/...
video · Added on July 22, 2025
· · views
Live-coded tutorial on implementing float support in a C scripting language, directly relevant to compilers and systems programming.
video · Added on July 22, 2025
· · views
Euler's Number, e, is one of the most prominent constants in mathematics and exponential functions are some of the most important in maths. In this video: we ta...
video · Added on July 22, 2025
· 3:13:13 · 181.2K views
Had so much fun chatting with my good friends Trenton Bricken and Sholto Douglas on the podcast.
No way to summarize it, except:
This is the best context dum...
video · Added on July 22, 2025
· 13:40 · 387.6K views
Andrew Ng, founder of DeepLearning.AI and AI Fund, speaks at Sequoia Capital's AI Ascent about what's next for AI agentic workflows and their potential to signi...
video · Added on July 22, 2025
· 1:11:34 · 113.7K views
Bioelectric networks as targets for regenerative medicine
Uploader: AIMSS - AI in Medical Systems Society
Duration: 4294s
Views: 113690
video · Added on July 22, 2025
· 26:20 · 483.9K views
Remember when I used a video with a coconut in the thumbnail to drive a stake through the heart of mathematical structure? Today, in this introduction to the ba...
video · Added on July 22, 2025
· 56:32 · 93.3K views
Alexander Borst, Max-Planck-Institute for Biological Intelligence, Martinsried, Germany
Abstract: Detecting the direction of image motion is important for visu...
video · Added on July 22, 2025
· 31:13 · 1.2M views
This is the most information-dense introduction to group theory you'll see on this website. If you're a computer scientist like me and have always wondered what...
video · Added on July 22, 2025
· 40:08 · 767.5K views
Shortform link:
https://shortform.com/artem
In this video we will talk about backpropagation – an algorithm powering the entire field of machine learning and ...
video · Added on July 22, 2025
· 1:08:49 · 31.9K views
Playlist: https://www.youtube.com/playlist?list=PLOROtRhtegr7DmeMyFxfKxsljAVsAn_X4
When are two shapes the "same"? Topics covered include deformation retract, ...
video · Added on July 22, 2025
· · views
Before tmux, I spent most of my time working in graphical editors and interfaces, and found working in the terminal difficult to do.
After moving to tmux and l...
video · Added on July 22, 2025
· · views
The number one question I get is to explain how I set up my development environment.
So, I decided to do a video on one of the key components of my setup. NVCh...
video · Added on July 22, 2025
· · views
Neovim is perhaps the best editor in my opinion. When set up correctly, it can empower you to be productive, especially when writing python code.
I want to hel...
video · Added on July 22, 2025
· 10:40 · 372.5K views
Entropy, Cross-Entropy and KL-Divergence are often used in Machine Learning, in particular for training classifiers. In this short video, you will understand wh...
video · Added on July 22, 2025
· · views
Full Episode: https://youtu.be/UTuuTTnjxMQ
Website & Transcript: https://www.dwarkeshpatel.com/p/sholto-douglas-trenton-bricken
Spotify: https://open.spotify.c...
video · Added on July 22, 2025
· 1:17:59 · 12.1K views
The Power of Counting Arguments (Cadence Labs Distinguished Speaker Series, 16 January 2001)
Downloaded from
http://www.cs.utexas.edu/users/EWD/video-audio/vide...
video · Added on July 22, 2025
· 31:12 · 5.5K views
This is the second half of the Hardware SPI video that would have been far too long. As such the beginning jumps strait into implementation, I usually try and s...
video · Added on July 22, 2025
· · views
The Zero to ASIC Course covers everything you need to design your own chips.
Sign up to the newsletter - https://www.zerotoasiccourse.com/newsletter
Uploader: ...
video · Added on July 22, 2025
· 42:24 · 6.6K views
We dive into the Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution paper, a technique, competitive with GPT-2, that can use diffusio...
video · Added on July 22, 2025
· · views
In this video I talk about how to self-study mathematics. Do you have advice for people learning mathematics on their own? If so, please leave a comment below.
...
bookmark · Added on July 21, 2025
· 5 min read
(See also the ESF lecture series webpage for information about the plenary meetings. It is more likely to be current.) Week 1: September 24-28. Reading: Education and Discipline (Russell); Meno (Pl…
bookmark · Added on July 21, 2025
· 28 min read
Homepage of the Goomba AI Lab @ CMU MLD.
bookmark · Added on July 21, 2025
· 17 min read
Homepage of the Goomba AI Lab @ CMU MLD.
bookmark · Added on July 21, 2025
· 14 min read
Neural networks scale they way they do, purely because of data.
bookmark · Added on July 21, 2025
· 3h 38m read
How to think like a classical statistical-mechanic, with many examples from gas theory, biology, probability, and information theory. Prerequisites: thermodynamics, calculus, probability, and mathematical maturity.
bookmark · Added on July 21, 2025
· 2h 0m read
My basic framework for understanding the modern Chinese economy and its growth. Includes brief histories of Chinese, Soviet, and Japanese economy. Framework applied to two case studies on the stock market and the housing market.
bookmark · Added on July 21, 2025
· 3h 17m read
What every graduate student should know about analytical mechanics, delivered with economic style and many illustrations. Particular focus on particle-wave duality and old quantum theory. Prerequisites: multivariate calculus and mathematical maturity.
bookmark · Added on July 21, 2025
· 6 min read
Why does Adam with aggressive gradient value/norm clipping have sparse updates and do well with higher learning rates? Here we show that it is essentially equivalent to a smoothed version of SignSGD/NormSGD.
bookmark · Added on July 18, 2025
· 20 min read
Historically I have worked on many projects related to high-performance Protobuf, be that on the C++ runtime, on the Rust runtime, or on integrating UPB, the fastest Protobuf runtime, written by my colleague Josh Haberman.
bookmark · Added on July 16, 2025
· 3 min read
Goedel-Prover-V2: The Strongest Open-Source Theorem Prover to Date
bookmark · Added on July 16, 2025
· 2 min read
Single-producer/single-consumer (SPSC) queues are essential for achieving high throughput in real-time applications like telemetry ingestion, game loops, and...
bookmark · Added on July 15, 2025
· 12 min read
Some thoughts on API design with GATs from implementing a generic picker library
bookmark · Added on July 15, 2025
· 12 min read
Eleven init systems enter, one init system leaves.
bookmark · Added on July 15, 2025
· 5 min read
mildbyte.xyz
bookmark · Added on July 15, 2025
· 16 min read
In the face of disruptive technologies, moats created by closed source are temporary. Even OpenAI’s closed source approach can’t prevent others from catching up.
bookmark · Added on July 15, 2025
· 1 min read
Vertex Block Descent is a fast physics-based simulation method that is unconditionally stable, highly parallelizable, and capable of converging to the implicit Euler solution.
bookmark · Added on July 15, 2025
· 17 min read
The Oberon System[3] is a modular, single-user, single-process, multitasking operating system written in the programming language Oberon.
bookmark · Added on July 15, 2025
· 3h 17m read
What every graduate student should know about analytical mechanics, delivered with economic style and many illustrations. Particular focus on particle-wave duality and old quantum theory. Prerequisites: multivariate calculus and mathematical maturity.
bookmark · Added on July 15, 2025
· 41 min read
A list of key concepts for building and testing reliable distributed systems, with basic definitions and deep references.
bookmark · Added on July 14, 2025
· 19 min read
During the 2020 Covid lockdowns, the privately owned realms of the internet became the obligatory venues for public life.
bookmark · Added on July 10, 2025
· 3 min read
Lifted from Mathoverflow:
I think (almost) everyone agrees that Hartshorne's Algebraic Geometry is still the best.
Then what might be the 2nd best? It can be a book, preprint, online lecture note,
bookmark · Added on July 9, 2025
· 5 min read
Learn how to optimize your app for Apple silicon with two new hardware-assisted tools in Instruments. We'll start by covering how to...
bookmark · Added on July 9, 2025
· 13 min read
SemiAnalysis is hiring an analyst in New York City for Core Research, our world class research product for the finance industry. Please apply here It’s been a bit over 150 days since the launc…
bookmark · Added on July 9, 2025
· 1 min read
Federico Faggin is one of the greatest luminaries of high technology alive today. A physicist by education, he is the inventor of the microprocessor and the MOS silicon gate technology, both of which underlie the modern world's entire information technology. With the knowledge and experience of a li
bookmark · Added on July 9, 2025
· 5 min read
Philosophical Foundations of Neuroscience : Bennett, M. R., Hacker, P. M. S.: Amazon.co.uk: Books
bookmark · Added on July 9, 2025
· 5 min read
The Oxide Computer Company job application process1 asks applicants to answer a set of personal questions about their career and experiences.
bookmark · Added on July 9, 2025
· 7 min read
On a whim, I picked up a copy of the book The Creative Act: A Way of Being at First Light Books in Austin, Texas.
bookmark · Added on July 9, 2025
· 5 min read
Orca operates on a base of 36 increments. Operators using numeric values will typically also operate on letters and convert them into values as per the following table.
bookmark · Added on July 9, 2025
· 31 min read
While we are grateful to have had the opportunity to give this presentation, an event in 2025 has resulted in us distancing ourselves from the conference responsible for hosting this talk.
bookmark · Added on July 9, 2025
· 23 min read
I first heard of Strange Loop a few years ago, the talk that introduced me to the conference was Programming Should Eat Itself by Nada Amin.
bookmark · Added on July 5, 2025
· 1h 30m read
Agda is a wonderful language and its unification engines are exemplary, practical, improve over time and work predictably well.
bookmark · Added on July 2, 2025
· 19 min read
Although these fused attention implementations have substantially improved performance and enabled long contexts, this efficiency has come with a loss of flexibility.
bookmark · Added on July 1, 2025
· 8h 6m read
—————SOURCES————————————————————————
Percolation – Béla Bollobás and Oliver Riordan
Cambridge University Press, New York, 2006.
Sixty Years of Percolation – Hugo Duminil-Copin
https://www.ihes.fr/~duminil/publi/2018ICM.pdf
Percolation – Geoffrey Grimmett
volume 321 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin, second edition, 1999.
—————NOTES—————————————————————————
Note at 10:42 – The uniqueness of the infinite cluster is known for the d-dimenional lattice since the works of Aizenman, Kesten and Newman - [Uniqueness of the infinite cluster and continuity of connectivity functions for short and long range percolation (1987)] and Burton and Keane - [Density and uniqueness in percolation (1989)]. It does not hold in general: when the graph in question is a regular tree for example, there are always infinitely many clusters during the supercritical phase.
The two last results shown here are only known for site percolation (in which vertices are open or closed instead of edges) in the triangular lattice, where a scaling limit for the boundaries of critical clusters was proved to exist (more on that in the third note). It is believed that these results are universal, that is, valid in great generality for planar percolation processes near criticality.
The third result is from an appendix by Gábor Pete in the paper [Scaling limits for the threshold window: When does a monotone Boolean function flip its outcome? (2017)] by Ahlberg and Steif. Consider an n by n box, and the event where there exists a left-right crossing of said box. Recall the uniform coupling from the video: intuitively, the result is saying that the point at which this crossing emerges in the uniform coupling is with high probability inside an interval of size n^{-3/4} around 1/2.
The fourth result is saying that the average size of the cluster of the origin (or any other given point) goes to infinity as we let p approach the critical parameter like a specific power of the distance between p and p_c. This power is called a critical exponent. The existence of these exponents was proved by Smirnov and Werner in the paper [Critical exponents for two-dimensional percolation (2001)].
Note at 10:52 – Hugo Duminil-Copin has several major contributions to the study of processes arising in statistical physics, including Bernoulli percolation. Among his works on Ising and Ising-like processes we can cite [Random Currents and Continuity of Ising Model’s Spontaneous Magnetization (2015)] with Aizenman and Sidoravicius and [Sharp phase transition for the random-cluster and Potts models via decision trees (2019)] with Raoufi and Tassion.
Note at 12:38 – In the triangular lattice site percolation, Stanislav Smirnov proved the conformal invariance of crossing probabilities at criticality (see https://www.unige.ch/~smirnov/papers/icmp-final.pdf for an overview), which led to the proof of the existence of scaling limits of exploration curves as Schramm–Loewner evolution processes. See [Critical percolation in the plane (2009)] by Smirnov. This provided a deep understanding of the critical phase in the triangular lattice site percolation, which to this day is not extended to the square lattice.
Note at 17:52 – It is not at all obvious that the probability of being connected to infinity is continuous above criticality. This result can be proved in the d-dimenional hypercubic lattices using the uniqueness of the infinite cluster, and more generally it was proved for transitive graphs (intuitively, graphs in which all vertices look the same) by Häggström, Peres and Schonmann in [Percolation on transitive graphs as a coalescent process: Relentless merging followed by simultaneous uniqueness (1999)].
—————SECTIONS———————————————————————
0:00 Introduction
1:37 Definition – Bernoulli Percolation
5:23 Definition – Uniform Coupling
7:56 Exploration – High-Resolution Square Grid
9:40 Exploration – Questions and Kesten's Theorem
10:58 Exploration – Ising Model
11:54 Exploration – Critical Percolation
12:50 Exploration – Three-Dimensional Cubic Lattice and Beyond
14:13 Proof – Theorem Statement
15:14 Proof – Simplifications
16:29 Proof – Definition of Critical Parameter
18:41 Proof – Critical Parameter is Greater Than Zero
20:44 Proof – Duality Definition
21:56 Proof – Critical Parameter is Less Than One
25:16 Proof – Summary and Idea for Kesten's Theorem
26:11 Conclusion
—————CREDITS————————————————————————
Caio Alves – writing, 3D animation
Aranka Hrušková – writing, clarinet
Vilas Winstein – writing, 2D animation, editing, voice-over
Special thanks to Anisah Awad, Gábor Pete, Jyotsna Sreenivasan, Angie Zavala
This video is an entry in the second Summer of Mathematics Exposition (#SoME2)
The photographs used in this video are licensed under the Creative Commons Attribution-ShareAlike license:
https://creativecommons.org/licenses/by-sa/4.0/deed.en
Uploader: Spectral Collective
Duration: 1612s
Views: 455517
bookmark · Added on July 1, 2025
· 18 min read
Multilevel modeling of effect of small group’s meditation on math errors
bookmark · Added on June 28, 2025
· 51 min read
A visual walkthrough of the AlphaFold3 architecture, with more details and diagrams than you were probably looking for.
bookmark · Added on June 27, 2025
· 18 min read
In May 2022, someone posted to Hacker News Bartosz Ciechanowski's blog post explaining how mechanical watch movements work. Since then, his blog has been my absolute favorite corner of the Internet.
bookmark · Added on June 27, 2025
· 41 min read
Muon from first principles, what makes it different from other optimizers, and why it works so well.
bookmark · Added on June 27, 2025
· 10 min read
Manfred Mohr, Cubic Limit: P-197 (1977)I enjoy shocking people by telling them I don’t use LLMs.This isn’t true, but it’s morally true for the reference clas...
bookmark · Added on June 27, 2025
· 3 min read
I had a bunch of notion pages in which I had written some notes while reading and watching videos on GPUs for CUDA purpose so thought of doing vibe blogging ...
bookmark · Added on June 27, 2025
· 9 min read
CedarDB is a database system that delivers unmatched performance for transactions and analytics, from small writes to handling billions of rows. Built on cutting-edge research to power today’s tools and tomorrow’s challenges.
bookmark · Added on June 27, 2025
· 1 min read
An exploration of the layout models of TeX and Typst.
bookmark · Added on June 27, 2025
· 7 min read
It's not often you find a classic, but I think I've found a new classic for software and computer hardware developers. It's David J.
bookmark · Added on June 27, 2025
· 40 min read
Dissecting various compression algorithms.
bookmark · Added on June 27, 2025
· 24 min read
Introducing Continuous Thought Machines: a new kind of neural network model that unfolds and uses neural dynamics as a powerful representation for thought.
bookmark · Added on June 26, 2025
· 10 min read
Research on predictive processing models has focused largely on two specific algorithmic theories: Predictive Coding for perception and Active Inferen…
bookmark · Added on June 26, 2025
· 17 min read
On the the engineering challenges and lessons learned from building Claude's Research system
bookmark · Added on June 26, 2025
· 1 min read
Welcome to the home page of the 46th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2025)!
PLDI is the premier forum in the field of programming languages and programming systems research, covering the areas of design, implementation, theory, applications, and performance.
PLDI 2025 will be held in-person at the Westin Josun Seoul in Seoul, South Korea.
The main PLDI conference will be held Wednesday, 18 June through Friday, 20 June. Workshops and tutorials were held on Monday, 16 June and Tuesday, 17 June.
PLDI 2025 Travel Guide
Nuno Lopes has kindly writte ...
bookmark · Added on June 26, 2025
· 2 min read
There are lot of reasons why you should consider adopting Resonate.
bookmark · Added on June 26, 2025
· 18 min read
Their origins go back to Google in 2006, when they were first evaluating whether they should implement either GPUs, FPGAs, or custom ASICs.
bookmark · Added on June 26, 2025
· 11 min read
A coding philosophy focused on safety, performance, and developer experience.
bookmark · Added on June 26, 2025
· 1 min read
Shared logs offer linearizable total order across storage shards. However, they enforce this order eagerly upon ingestion, leading to high latencies.
bookmark · Added on June 26, 2025
· 6 min read
This clickable map (adapted from Bailis, Davidson, Fekete et al and Viotti & Vukolic) shows the relationships between common consistency models for concurrent systems.
bookmark · Added on June 26, 2025
· 29 min read
bookmark · Added on June 26, 2025
· 26 min read
Dependent type theory is a powerful and expressive language, allowing you to express complex mathematical assertions, write complex hardware and software specifications, and reason about both of these in a natural and uniform way.
bookmark · Added on June 26, 2025
· 1 min read
Convex is the reactive database for app developers. Everything you need to build your full-stack project.
bookmark · Added on June 26, 2025
· 1 min read
Generate a timeline using AI or from scratch. Perfect for project plans, school assignments, fiction writing, legal matters, and much more.
bookmark · Added on June 26, 2025
· 17 min read
By using feature inversion to visualize millions of activations from an image classification network, we create an explorable activation atlas of features the network has learned and what concepts it typically represents.
bookmark · Added on June 26, 2025
· 35 min read
Can agents learn inside of their own dreams?
bookmark · Added on June 12, 2025
· 1h 34m read
Are world models a necessary ingredient for flexible, goal-directed
behaviour, or is model-free learning sufficient? We provide a formal answer to
this question, showing that any agent capable of generalizing to multi-step
goal-directed tasks must have learned a predictive model of its environment. We
show that this model can be extracted from the agent's policy, and that
increasing the agents performance or the complexity of the goals it can achieve
requires learning increasingly accurate world models. This has a number of
consequences: from developing safe and general agents, to bounding agent
capabilities in complex environments, and providing new algorithms for
eliciting world models from agents.
bookmark · Added on June 5, 2025
· 8 min read
The single most undervalued fact of linear algebra: matrices are graphs, and graphs are matrices
bookmark · Added on May 29, 2025
· 32 min read
fleetwood.dev
bookmark · Added on May 29, 2025
· 9 min read
Key architecture innovation behind DeepSeek-V2 and DeepSeek-V3 for faster inference
bookmark · Added on May 16, 2025
· 6 min read
State-of-the-art image diffusion models take tens of seconds to process a single image. This makes video diffusion even more challenging, requiring significant computational resources and high costs.
bookmark · Added on May 16, 2025
· 13 min read
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
bookmark · Added on May 16, 2025
· 11 min read
supaiku dot com § attention is logarithmic, actually § time complexity is a very bad model when working with parallelism. in which i make the case for work-depth analysis instead of time complexity.
bookmark · Added on May 16, 2025
· 14 min read
The US has signed two landmark agreements with the United Arab Emirates and Kingdom of Saudi Arabia (KSA) that that will noticeably shift the balance of power. The deals have economic, geopolitical…
bookmark · Added on May 16, 2025
· 19 min read
Produced while being an affiliate at PIBBSS[1]. The work was done initially with funding from a Lightspeed Grant, and then continued while at PIBBSS.…
bookmark · Added on May 16, 2025
· 11 min read
I want to provide some tips from my experience implementing a paper. I'm going to cover my tips so far from implementing a dramatically scaled-down versio...
bookmark · Added on May 16, 2025
· 9 min read
A reflection on control, burnout, and the strange weight of technical fluency.
bookmark · Added on May 16, 2025
· 4h 59m read
bookmark · Added on May 16, 2025
· 1 min read
MAP-Elites is a method in reinforcement learning to avoid the local optimum of a search space by storing multiple candidate solutions…
bookmark · Added on May 13, 2025
· 53 min read
While there are already excellent posts on scaling, I wanted to share my own understanding and things i've learned from my past few months and hopefully spark some discussion. I hope this post can shed light for anyone navigating the challenges of scaling up neural networks. And there may be mistakes or inaccuracies, so if you want to correct me or would like to discuss further, please feel free to DM me on X or leave a comment.
bookmark · Added on May 6, 2025
· 29 min read
ML blog.
bookmark · Added on May 4, 2025
· 50 min read
As AI systems are used in high-stakes applications, ensuring interpretability
is crucial. Mechanistic Interpretability (MI) aims to reverse-engineer neural
networks by extracting human-understandable algorithms to explain their
behavior. This work examines a key question: for a given behavior, and under
MI's criteria, does a unique explanation exist? Drawing on identifiability in
statistics, where parameters are uniquely inferred under specific assumptions,
we explore the identifiability of MI explanations.
We identify two main MI strategies: (1) "where-then-what," which isolates a
circuit replicating model behavior before interpreting it, and (2)
"what-then-where," which starts with candidate algorithms and searches for
neural activation subspaces implementing them, using causal alignment.
We test both strategies on Boolean functions and small multi-layer
perceptrons, fully enumerating candidate explanations. Our experiments reveal
systematic non-identifiability: multiple circuits can replicate behavior, a
circuit can have multiple interpretations, several algorithms can align with
the network, and one algorithm can align with different subspaces.
Is uniqueness necessary? A pragmatic approach may require only predictive and
manipulability standards. If uniqueness is essential for understanding,
stricter criteria may be needed. We also reference the inner interpretability
framework, which validates explanations through multiple criteria. This work
contributes to defining explanation standards in AI.
bookmark · Added on May 3, 2025
· 1 min read
Despite the widespread adoption of Transformer models for NLP tasks, the expressive power of these models is not well-understood. In this paper, we establish that Transformer models are universal approximators of continuous permutation equivariant sequence-to-sequence functions with compact support, which is quite surprising given the amount of shared parameters in these models. Furthermore, using positional encodings, we circumvent the restriction of permutation equivariance, and show that Transformer models can universally approximate arbitrary continuous sequence-to-sequence functions on a compact domain. Interestingly, our proof techniques clearly highlight the different roles of the self-attention and the feed-forward layers in Transformers. In particular, we prove that fixed width self-attention layers can compute contextual mappings of the input sequences, playing a key role in the universal approximation property of Transformers. Based on this insight from our analysis, we consider other simpler alternatives to self-attention layers and empirically evaluate them.
bookmark · Added on May 3, 2025
· 1 min read
The ultimate guide to training LLM on large GPU Clusters
bookmark · Added on May 3, 2025
· 1h 44m read
bookmark · Added on May 1, 2025
· 10 min read
In this short note, we give an elementary proof of a universal approximation
theorem for neural networks with three hidden layers and increasing,
continuous, bounded activation function. The result is weaker than the best
known results, but the proof is elementary in the sense that no machinery
beyond undergraduate analysis is used.
bookmark · Added on April 27, 2025
· 3 min read
Last week we took an intuitive peek into the First Isomorphism Theorem as one example in our ongoing discussion on quotient groups.
bookmark · Added on April 27, 2025
· 9 min read
Quantum entanglement is, as you know, a phrase that's jam-packed with meaning in physics. But what you might not know is that the linear algebra behind it is quite simple.
bookmark · Added on April 24, 2025
· 50 min read
Large language models (LLMs) are restricted to reason in the “language space”, where they typically express the reasoning process with a chain-of-thought (CoT) to solve a complex reasoning problem.
bookmark · Added on April 24, 2025
· 16 min read
How It's Made: Fancy Sand
bookmark · Added on April 24, 2025
· 5 min read
Across all experience levels.
bookmark · Added on April 24, 2025
· 34 min read
bookmark · Added on April 22, 2025
· 6 min read
Incredibly self-destructive cancelation of Qualcomm's v8 ALA.
bookmark · Added on April 22, 2025
· 13 min read
The optimal amount of self-satisfaction is not zero
bookmark · Added on April 22, 2025
· 2 min read
Gentle step-by-step guide through the abstract and complex universe of Fragment Shaders.
bookmark · Added on April 22, 2025
· 10 min read
Originally from replies to a Twitter thread: https://x.com/TheGingerBill/status/1914389352416993395
This is not a structured argument against FOSS/OSS but my uncommon thoughts on the topic.
I am not sure if I agree [that FOSS/OSS derives from the same thinking process as the ideology of communism], but I understand the sentiment. The fundamental issue is that software is trivially copyable. I have loads of issues with FOSS and OSS1. And part of this “ideology” (as presented in the original post) is naïvety coupled with only first-order thinking and a poor understanding of ownership.
bookmark · Added on April 22, 2025
· 1 min read
On Bloat Rob Pike Brighter Tech Commonwealth Bank September 30, 2024
bookmark · Added on April 22, 2025
· 1 min read
Large language models (LLMs) are restricted to reason in the "language space", where they typically express the reasoning process with a chain-of-thought (CoT) to solve a complex reasoning problem. However, we argue that language space may not always be optimal for reasoning. For example, most word tokens are primarily for textual coherence and not essential for reasoning, while some critical tokens require complex planning and pose huge challenges to LLMs. To explore the potential of LLM reasoning in an unrestricted latent space instead of using natural language, we introduce a new paradigm Coconut (Chain of Continuous Thought). We utilize the last hidden state of the LLM as a representation of the reasoning state (termed "continuous thought"). Rather than decoding this into a word token, we feed it back to the LLM as the subsequent input embedding directly in the continuous space. Experiments show that Coconut can effectively augment the LLM on several reasoning tasks. This novel latent reasoning paradigm leads to emergent advanced reasoning patterns: the continuous thought can encode multiple alternative next reasoning steps, allowing the model to perform a breadth-first search (BFS) to solve the problem, rather than prematurely committing to a single deterministic path like CoT. Coconut outperforms CoT in certain logical reasoning tasks that require substantial backtracking during planning, with fewer thinking tokens during inference. These findings demonstrate the promise of latent reasoning and offer valuable insights for future research.
bookmark · Added on April 22, 2025
· 3h 34m read
Large language models display impressive capabilities. However, for the most part, the mechanisms by which they do so are unknown.
bookmark · Added on April 22, 2025
· 1 min read
We ask whether multilingual language models trained on unbalanced, English-dominated corpora use English as an internal pivot language -- a question of key importance for understanding how language models function and the origins of linguistic bias. Focusing on the Llama-2 family of transformer models, our study uses carefully constructed non-English prompts with a unique correct single-token continuation. From layer to layer, transformers gradually map an input embedding of the final prompt token to an output embedding from which next-token probabilities are computed. Tracking intermediate embeddings through their high-dimensional space reveals three distinct phases, whereby intermediate embeddings (1) start far away from output token embeddings; (2) already allow for decoding a semantically correct next token in the middle layers, but give higher probability to its version in English than in the input language; (3) finally move into an input-language-specific region of the embedding space. We cast these results into a conceptual model where the three phases operate in "input space", "concept space", and "output space", respectively. Crucially, our evidence suggests that the abstract "concept space" lies closer to English than to other languages, which may have important consequences regarding the biases held by multilingual language models.
bookmark · Added on April 22, 2025
· 15 min read
Roughly every two years, the density of transistors that can be fit onto a silicon chip doubles.
bookmark · Added on April 22, 2025
· 31 min read
bookmark · Added on April 22, 2025
· 3 min read
This isn't a new intuition, but a nice new set of results.
bookmark · Added on April 19, 2025
· 5 min read
:metal: TT-NN operator library, and TT-Metalium low level kernel programming model. - tenstorrent/tt-metal
bookmark · Added on April 19, 2025
· 55 min read
bookmark · Added on April 19, 2025
· 14 min read
When I first started programming Metalium. Memory was a
bookmark · Added on April 19, 2025
· 4 min read
The last layer’s hidden state in a transformer is meant only for being decoded into token probabilities. Don’t use it for autoregressive image generation Dont’t use it for looped latent transformers Only use it to produce the next token in a language model It is a compressed representation of the...
bookmark · Added on April 19, 2025
· 36 min read
A two-dimensional Dirac graphene-channel flash memory based on a two-dimensional-enhanced hot-carrier-injection mechanism that supports both electron and hole injection is used to make devices with a subnanosecond program speed.
bookmark · Added on April 19, 2025
· 25h 42m read
bookmark · Added on April 19, 2025
· 4 min read
Language models serve as the cornerstone of modern natural language processing (NLP) applications and open up a new paradigm of having a single general purpose system address a range of downstream tasks.
bookmark · Added on April 19, 2025
· 7 min read
Even though lots of people nowadays advocate for applying functional programming principles to JavaScript, not many of them know the principles of Lambda Cal...
bookmark · Added on April 19, 2025
· 19 min read
Yet it seems to me that the situation right now is that LtU has readers with very different backgrounds, among them many readers who haven't studied PL formally.
bookmark · Added on April 19, 2025
· 6h 46m read
bookmark · Added on April 19, 2025
· 45 min read
Personal site for posts about my interests: the biotech industry, medicine, molecular biology, neuroscience, biorisk, science, consciousness, AI, innovation, decision making, philosophy, games, sci-fi, probability, and forecasting (among other things). I write to learn, mostly about biotech.
bookmark · Added on April 17, 2025
· 17 min read
Astro description
bookmark · Added on April 15, 2025
· 1h 33m read
Stephen Wolfram explores the broader picture of what's going on inside ChatGPT and why it produces meaningful text. Discusses models, training neural nets, embeddings, tokens, transformers, language syntax.
bookmark · Added on April 10, 2025
· 44 min read
In this paper, we describe Apollo, to the best of our knowledge, the world's
first large-scale production deployment of optical circuit switches (OCSes) for
datacenter networking. We will first describe the infrastructure challenges and
use cases that motivated optical switching inside datacenters. We then delve
into the requirements of OCSes for datacenter applications: balancing cost,
port count, switching time, and optical performance, which drive design choices
and implementation details of our internally developed 3D MEMS-based OCS. To
enable the Apollo optical switching layer, we employ circulators to realize
bidirectional links through the OCS, effectively doubling the OCS radix. The
OCS and circulator design choices were critical for meeting network bandwidth,
scale, and cost targets. We review the critical co-design of WDM transceiver
technology for these OCS plus circulator-based bidirectional links and their
corresponding physical impairments, delivered over four generations/speeds of
optical interconnect. Finally, we conclude with thoughts on future directions
in hardware development and associated applications.
bookmark · Added on April 10, 2025
· 1h 5m read
I argue that data becomes temporarily interesting by itself to some
self-improving, but computationally limited, subjective observer once he learns
to predict or compress the data in a better way, thus making it subjectively
simpler and more beautiful. Curiosity is the desire to create or discover more
non-random, non-arbitrary, regular data that is novel and surprising not in the
traditional sense of Boltzmann and Shannon but in the sense that it allows for
compression progress because its regularity was not yet known. This drive
maximizes interestingness, the first derivative of subjective beauty or
compressibility, that is, the steepness of the learning curve. It motivates
exploring infants, pure mathematicians, composers, artists, dancers, comedians,
yourself, and (since 1990) artificial systems.
bookmark · Added on April 10, 2025
· 1 min read
The proliferation of AI-generated content online has fueled concerns over \emph{model collapse}, a degradation in future generative models' performance when trained on synthetic data generated by earlier models. Industry leaders, premier research journals and popular science publications alike have prophesied catastrophic societal consequences stemming from model collapse. In this position piece, we contend this widespread narrative fundamentally misunderstands the scientific evidence. We highlight that research on model collapse actually encompasses eight distinct and at times conflicting definitions of model collapse, and argue that inconsistent terminology within and between papers has hindered building a comprehensive understanding of model collapse. To assess how significantly different interpretations of model collapse threaten future generative models, we posit what we believe are realistic conditions for studying model collapse and then conduct a rigorous assessment of the literature's methodologies through this lens. While we leave room for reasonable disagreement, our analysis of research studies, weighted by how faithfully each study matches real-world conditions, leads us to conclude that certain predicted claims of model collapse rely on assumptions and conditions that poorly match real-world conditions, and in fact several prominent collapse scenarios are readily avoidable. Altogether, this position paper argues that model collapse has been warped from a nuanced multifaceted consideration into an oversimplified threat, and that the evidence suggests specific harms more likely under society's current trajectory have received disproportionately less attention.
bookmark · Added on April 10, 2025
· 2h 37m read
bookmark · Added on April 10, 2025
· 6 min read
bookmark · Added on April 7, 2025
· 1 min read
The RWKV Language Model
bookmark · Added on April 7, 2025
· 11 min read
About nine months ago, I and three friends decided that AI had gotten good enough to monitor large codebases autonomously for security problems. We s…
bookmark · Added on April 6, 2025
· 31 min read
The past few years have witnessed a growth in size and computational
requirements for training and inference with neural networks. Currently, a
common approach to address these requirements is to use a heterogeneous
distributed environment with a mixture of hardware devices such as CPUs and
GPUs. Importantly, the decision of placing parts of the neural models on
devices is often made by human experts based on simple heuristics and
intuitions. In this paper, we propose a method which learns to optimize device
placement for TensorFlow computational graphs. Key to our method is the use of
a sequence-to-sequence model to predict which subsets of operations in a
TensorFlow graph should run on which of the available devices. The execution
time of the predicted placements is then used as the reward signal to optimize
the parameters of the sequence-to-sequence model. Our main result is that on
Inception-V3 for ImageNet classification, and on RNN LSTM, for language
modeling and neural machine translation, our model finds non-trivial device
placements that outperform hand-crafted heuristics and traditional algorithmic
methods.
bookmark · Added on April 5, 2025
· 10 min read
We are building an open future for AI. Own your silicon future. Join us.
bookmark · Added on April 5, 2025
· 1 min read
Metaphorically, you can think of Vision Transformers as the eyes of the system, able to understand and contextualize what it sees, while Stable Diffusion is the hand of the system, able to generate and manipulate images based on this understanding.
bookmark · Added on April 1, 2025
· 34 min read
Haskell is the world’s best programming language, but let’s face the harsh reality that a lot of times in life you’ll have to write in other programming languages. But alas you have been fully Haskell-brained and lost all ability to program unless it is type-directed, you don’t even know how to start writing a program without imagining its shape as a type first. Well, fear not. The foundational theory behind Algebraic Data Types and Generalized Algebraic Data Types (ADTs and GADTs) are so fundamental that they’ll fit (somewhat) seamlessly into whatever language you’re forced to write. After all, if they can fit profunctor optics in Microsoft’s Java code, the sky’s the limit! This is an “April Fools” joke in the tradition of my previous one in some of these ways that we are going to twist these other languages might seem unconventional or possibly ill-advised… but also the title is definitely a lie: these languages definitely should have them! :D
bookmark · Added on March 29, 2025
· 1 min read
Accelerate is a language for array-based computations, designed to exploit massive parallelism.
bookmark · Added on March 29, 2025
· 1 min read
Rust is safe. Rust is fast. Rust is powerful. And Rust is… sometimes completely unreadable.
bookmark · Added on March 29, 2025
· 3h 43m read
Deep learning models produce their outputs using a series of transformations distributed across many computational units (artificial “neurons”).
bookmark · Added on March 29, 2025
· 5 min read
Things that go wrong with disk IO
bookmark · Added on March 29, 2025
· 1 min read
GPUs are the most popular platform for accelerating HPC workloads, such as artificial intelligence and science simulations. However, most microarchitectural research in academia relies on GPU core pipeline designs based on architectures that are more than 15 years old.
This paper reverse engineers modern NVIDIA GPU cores, unveiling many key aspects of its design and explaining how GPUs leverage hardware-compiler techniques where the compiler guides hardware during execution. In particular, it reveals how the issue logic works including the policy of the issue scheduler, the structure of the register file and its associated cache, and multiple features of the memory pipeline. Moreover, it analyses how a simple instruction prefetcher based on a stream buffer fits well with modern NVIDIA GPUs and is likely to be used. Furthermore, we investigate the impact of the register file cache and the number of register file read ports on both simulation accuracy and performance.
By modeling all these new discovered microarchitectural details, we achieve 18.24% lower mean absolute percentage error (MAPE) in execution cycles than previous state-of-the-art simulators, resulting in an average of 13.98% MAPE with respect to real hardware (NVIDIA RTX A6000). Also, we demonstrate that this new model stands for other NVIDIA architectures, such as Turing. Finally, we show that the software-based dependence management mechanism included in modern NVIDIA GPUs outperforms a hardware mechanism based on scoreboards in terms of performance and area.
bookmark · Added on March 29, 2025
· 26 min read
:metal: TT-NN operator library, and TT-Metalium low level kernel programming model. - tenstorrent/tt-metal
bookmark · Added on March 29, 2025
· 35 min read
bookmark · Added on March 29, 2025
· 36 min read
bookmark · Added on March 28, 2025
· 6 min read
Growing up as a kid in rural Bavaria, I always …
bookmark · Added on March 28, 2025
· 6 min read
This article assumes that you have already used Yazi and are familiar with most of its features.
bookmark · Added on March 28, 2025
· 46 min read
To support GPU programming, the NVPTX back-end supports a subset of LLVM IR along with a defined set of conventions used to represent GPU programming concepts.
bookmark · Added on March 27, 2025
· 1h 15m read
I've spoken about Jim Keller many times on AnandTech.
bookmark · Added on March 25, 2025
· 10 min read
Notes/Primer on Clang Compiler Frontend: Introduction and Architecture
These are my notes on chapters 1 & 2 of the Clang Compiler Frontend by Ivan Murashko. The book is focused on teaching the fundamentals of LLVM to C++ engineers who are interested in learning about compilers to optimize their daily workflow by enhancing their code quality and overall development process. (I’ve referened this book extensively, and a lot of the snippets here are from this book.
bookmark · Added on March 25, 2025
· 14 min read
I am trying to make a simple microprocessor in verilog as a way to understand verilog and assembly at the same time.
I am not sure if I am implementing what I think of microprocessors well enough ...
bookmark · Added on March 24, 2025
· 1h 45m read
Learning FPGA, yosys, nextpnr, and RISC-V . Contribute to BrunoLevy/learn-fpga development by creating an account on GitHub.
bookmark · Added on March 24, 2025
· 30 min read
I genuinely can’t understand how anybody could look at the mess that’s Rust’s async and think that it was a good design for a language that already had the reputation of being very complicated to write.
bookmark · Added on March 24, 2025
· 13 min read
Calibrated AttentionCalibrated Attention NanoGPTAttention is the magic ingredient of modern neural networks. It is the core of what has launched performant language models into the spotlight starting with GPT, and since then, it has extended its hands across all modalities.There are a number of desirable properties that make attention a first-class building block. Namely: • It handles variable sequence lengths with ease • It allows for a global receptive field without needing to scale parameters
bookmark · Added on March 23, 2025
· 38 min read
I have encountered that there are mainly three types of blogs/videos/tutorials talking about transformers
bookmark · Added on March 22, 2025
· 21 min read
Intuitively Template Haskell provides new language features that allow us to convert back and forth between concrete syntax, i. e.
bookmark · Added on March 18, 2025
· 21 min read
[Twitter thread, Hacker News discussion]
bookmark · Added on March 18, 2025
· 4 min read
The section of the wiki allows anyone to document, explain, post questions, or make comments on the Lua source code. You may link to [1] or paste the code in question.
bookmark · Added on March 18, 2025
· 33 min read
bookmark · Added on March 18, 2025
· 1 min read
First, fun and exciting playtime. Then, intense and strenuous skill development. Finally, developing one’s individual style while pushing the boundaries of the field.
bookmark · Added on March 18, 2025
· 3h 33m read
bookmark · Added on March 18, 2025
· 1h 3m read
bookmark · Added on March 18, 2025
· 6 min read
The origins of set theory can be traced back to a Bohemian priest, Bernhard Bolzano (1781-1848), who was a professor of religion at the University of Prague.
bookmark · Added on March 17, 2025
· 9 min read
Guido van Rossum is the author of Python, an interpreted, interactive object-oriented programming language.
bookmark · Added on March 17, 2025
· 1h 27m read
bookmark · Added on March 17, 2025
· 11 min read
:metal: TT-NN operator library, and TT-Metalium low level kernel programming model. - tenstorrent/tt-metal
bookmark · Added on March 17, 2025
· 1 min read
The Tenstorrent Wormhole n300s PCIe accelerator board is available for purchase, featuring 672 RISC-V cores driving 466 TFLOP/s of FP8 matmul.
bookmark · Added on March 17, 2025
· 1 min read
This presentation delves into the fascinating and sometimes aggravating world of numerical data types, exploring the evolution, strengths, and weaknesses of decimal, fixed point, floating point, and shared exponent formats over the past 70 years.
bookmark · Added on March 17, 2025
· 4 min read
Many asked about collaborations (details are in FAQ). Short answer: unless you're from Meta and willing to work with us in your spare time (20+ hrs/week), or you're an early-year PhD from UCB/NYU/CMU/UW (but application ddl was Jan 10, 2025).
Citation request: I'm delighted to know that multiple
bookmark · Added on March 17, 2025
· 4 min read
I've looked into alternative AI accelerators to continue my saga of running GGML on lower power-consumption hardware. The most promising - and the only one that ever replied to my emails - was Tenstorrent. This post is me deeply thinking about if buying their hardware for development is a good inve ...
bookmark · Added on March 10, 2025
· 11 min read
bookmark · Added on March 9, 2025
· 15 min read
However, there remain a number of concerns about them. One is that it can be quite challenging to understand what a neural network is really doing.
bookmark · Added on March 9, 2025
· 2 min read
Transformers are a type of neural network architecture which is popularly used for text generations, machine translations, etc.
bookmark · Added on March 9, 2025
· 1 min read
Transformer language models (LMs) exhibit behaviors -- from storytelling to code generation -- that appear to require tracking the unobserved state of an evolving world. How do they do so? We study state tracking in LMs trained or fine-tuned to compose permutations (i.e., to compute the order of a set of objects after a sequence of swaps). Despite the simple algebraic structure of this problem, many other tasks (e.g., simulation of finite automata and evaluation of boolean expressions) can be reduced to permutation composition, making it a natural model for state tracking in general. We show that LMs consistently learn one of two state tracking mechanisms for this task. The first closely resembles the "associative scan" construction used in recent theoretical work by Liu et al. (2023) and Merrill et al. (2024). The second uses an easy-to-compute feature (permutation parity) to partially prune the space of outputs, then refines this with an associative scan. The two mechanisms exhibit markedly different robustness properties, and we show how to steer LMs toward one or the other with intermediate training tasks that encourage or suppress the heuristics. Our results demonstrate that transformer LMs, whether pretrained or fine-tuned, can learn to implement efficient and interpretable state tracking mechanisms, and the emergence of these mechanisms can be predicted and controlled.
bookmark · Added on March 9, 2025
· 10 min read
The Transformer architecture introduced in this paper was a major breakthrough in sequence transduction methodologies, particularly within neural machine translation (NMT) and broader natural language processing (NLP).
bookmark · Added on March 7, 2025
· 4 min read
We announce the public release of online educational materials for self-learners of CFD using IPython Notebooks: the CFD Python Class!
bookmark · Added on March 6, 2025
· 11 min read
The following document provides an overview of the TT-MLIR project, with a focus on the technical specifications of an MLIR-based compiler stack. So what exactly is an MLIR-based compiler stack?
bookmark · Added on March 6, 2025
· 1 min read
Multi-Level IR Compiler Framework
bookmark · Added on March 6, 2025
· 11 min read
This paper has a really nice Intro, pay close attention to how they lay out the storyline.
bookmark · Added on March 3, 2025
· 29 min read
When implementations of the Transformer's self-attention layer utilize SRAM
instead of DRAM, they can achieve significant speedups. The Tenstorrent
Grayskull architecture provides a large SRAM, distributed across a grid of
cores. This work presents a fused kernel for Grayskull, that exclusively
utilizes its large SRAM by combining matrix multiplication, attention score
scaling and Softmax operations. Additionally, a dedicated Softmax kernel
utilizing the SRAM and a CPU implementation serving as a baseline are
presented. The Softmax operation consumes most of the runtime in the
computation of attention weights from queries and keys on Grayskull. The
speedup of the dedicated Softmax kernel compared to the CPU implementation is
up to $10 \times$, and the Softmax implementation inside the fused kernel is
approximately $1.8 \times$ faster than the dedicated Softmax kernel. The time
and memory complexity of all implementations is quadratic in sequence length.
Currently, the Grayskull e150 is approximately $30 \times$ cheaper for the
general public than an Nvidia H100 PCIe (a state-of-the-art GPU) and offers
approximately $1.5 \times$ more SRAM.
bookmark · Added on March 1, 2025
· 11 min read
At Sesame, our goal is to achieve “voice presence”—the magical quality that makes spoken interactions feel real, understood, and valued.
bookmark · Added on February 28, 2025
· 19 min read
bookmark · Added on February 27, 2025
· 1 min read
4) The differentiating function.
bookmark · Added on February 26, 2025
· 25 min read
All about how TPUs work, how they're networked together to enable multi-chip training and inference, and how they limit the performance of our favorite algorithms. While this may seem a little dry, it's super important for actually making models efficient.
bookmark · Added on February 25, 2025
· 1 min read
A re-construction of the fundamentals of programming as a small mathematical theory (PRISM) based on elementary set theory. Highlights:
$\bullet$ Zero axioms. No properties are assumed, all are proved (from standard set theory).
$\bullet$ A single concept covers specifications and programs.
$\bullet$ Its definition only involves one relation and one set.
$\bullet$ Everything proceeds from three operations: choice, composition and restriction.
$\bullet$ These techniques suffice to derive the axioms of classic papers on the "laws of programming" as consequences and prove them mechanically.
$\bullet$ The ordinary subset operator suffices to define both the notion of program correctness and the concepts of specialization and refinement.
$\bullet$ From this basis, the theory deduces dozens of theorems characterizing important properties of programs and programming.
$\bullet$ All these theorems have been mechanically verified (using Isabelle/HOL); the proofs are available in a public repository.
This paper is a considerable extension and rewrite of an earlier contribution [arXiv:1507.00723]
bookmark · Added on February 25, 2025
· 5 min read
A company called Tenstorrent design and sell PCIe cards for AI acceleration. At the time of writing, they've recently started shipping their Wormhole n150s and Wormhole n300s cards.
bookmark · Added on February 25, 2025
· 5 min read
An in depth look at Tenstorrent Wormhole, originally posted on corsix.org
bookmark · Added on February 20, 2025
· 1h 1m read
The utilization of programming language (PL) models, pre-trained on
large-scale code corpora, as a means of automating software engineering
processes has demonstrated considerable potential in streamlining various code
generation tasks such as code completion, code translation, and program
synthesis. However, current approaches mainly rely on supervised fine-tuning
objectives borrowed from text generation, neglecting unique sequence-level
characteristics of code, including but not limited to compilability as well as
syntactic and functional correctness. To address this limitation, we propose
PPOCoder, a new framework for code generation that synergistically combines
pre-trained PL models with Proximal Policy Optimization (PPO) which is a widely
used deep reinforcement learning technique. By utilizing non-differentiable
feedback from code execution and structure alignment, PPOCoder seamlessly
integrates external code-specific knowledge into the model optimization
process. It's important to note that PPOCoder is a task-agnostic and
model-agnostic framework that can be used across different code generation
tasks and PLs. Extensive experiments on three code generation tasks demonstrate
the effectiveness of our proposed approach compared to SOTA methods, achieving
significant improvements in compilation success rates and functional
correctness across different PLs.
bookmark · Added on February 19, 2025
· 1h 44m read
bookmark · Added on February 17, 2025
· 23 min read
how deep learning could rewrite the way we encode and decode video
bookmark · Added on February 17, 2025
· 10 min read
On Accepting Toil
bookmark · Added on February 17, 2025
· 1 min read
bookmark · Added on February 15, 2025
· 10 min read
Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and multifaceted nature of these systems.
bookmark · Added on February 15, 2025
· 1 min read
Learn about the most pressing challenges in LLM inference, along with some practical solutions.
bookmark · Added on February 15, 2025
· 3 min read
As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is emerging. Also known as AI reasoning or long…
bookmark · Added on February 12, 2025
· 1 min read
Yesterday I had lunch with a former Ph.D student of mine, who is now highly successful and tenured at a very good school. I was reminded that, over twenty years ago, I was Graduate Director of Admissions. One of my favorite strategies was to take strong candidates who applied for Masters and also offer them […]
bookmark · Added on January 27, 2025
· 1 min read
The text editor is antirez’s kilo, with some changes.
bookmark · Added on January 27, 2025
· 2h 19m read
The advent of Large Language Models (LLMs) represents a notable breakthrough
in Natural Language Processing (NLP), contributing to substantial progress in
both text comprehension and generation. However, amidst these advancements, it
is noteworthy that LLMs often face a limitation in terms of context length
extrapolation. Understanding and extending the context length for LLMs is
crucial in enhancing their performance across various NLP applications. In this
survey paper, we delve into the multifaceted aspects of exploring why it is
essential, and the potential transformations that superior techniques could
bring to NLP applications. We study the inherent challenges associated with
extending context length and present an organized overview of the existing
strategies employed by researchers. Additionally, we discuss the intricacies of
evaluating context extension techniques and highlight the open challenges that
researchers face in this domain. Furthermore, we explore whether there is a
consensus within the research community regarding evaluation standards and
identify areas where further agreement is needed. This comprehensive survey
aims to serve as a valuable resource for researchers, guiding them through the
nuances of context length extension techniques and fostering discussions on
future advancements in this evolving field.
bookmark · Added on January 25, 2025
· 2 min read
I'm Yasser and I've made it my mission to produce an alternative to LLVM, the current king of compiler backend libraries.
bookmark · Added on January 25, 2025
· 16 min read
Starting with a 192-byte one-liner that implements a Reverse Polish Notation arithmetic compiler, we'll work backward to transform it into readable JavaScript by removing one code golf trick at a time
bookmark · Added on January 25, 2025
· 35 min read
In this paper, we demonstrate that information retrieval can be accomplished
with a single Transformer, in which all information about the corpus is encoded
in the parameters of the model. To this end, we introduce the Differentiable
Search Index (DSI), a new paradigm that learns a text-to-text model that maps
string queries directly to relevant docids; in other words, a DSI model answers
queries directly using only its parameters, dramatically simplifying the whole
retrieval process. We study variations in how documents and their identifiers
are represented, variations in training procedures, and the interplay between
models and corpus sizes. Experiments demonstrate that given appropriate design
choices, DSI significantly outperforms strong baselines such as dual encoder
models. Moreover, DSI demonstrates strong generalization capabilities,
outperforming a BM25 baseline in a zero-shot setup.
bookmark · Added on January 25, 2025
· 13 min read
bookmark · Added on January 25, 2025
· 1 min read
bookmark · Added on January 23, 2025
· 4h 59m read
bookmark · Added on January 22, 2025
· 8 min read
successful modifications since its inception, let alone large-scale validation.
bookmark · Added on January 19, 2025
· 11 min read
Note: this post was written for Lean 3; the latest version, Lean 4, is a very different language.
Turn back the clock to 2009: a confused physics major newly infatuated with math and computer science, I enrolled in MATH 273: Numbers and Proofs at the University of Calgary. This wasn’t my first encounter with mathematical proof; in first-year calculus I’d mastered rote regurgitation of delta-epsilon proofs. Despite writing out several dozen, their meaning never progressed beyond a sort of incantation I can summon to this day (for every \( \epsilon > 0 \) there exists a \( \delta > 0 \) such that…).
bookmark · Added on January 18, 2025
· 9 min read
Artificial Intelligence (AI) is advancing at an unprecedented pace, and the DeepSeek-V3 model is at the forefront of this revolution. As…
bookmark · Added on January 17, 2025
· 1 min read
This is a book about large language models. As indicated by the title, it primarily focuses on foundational concepts rather than comprehensive coverage of all cutting-edge technologies. The book is structured into four main chapters, each exploring a key area: pre-training, generative models, prompting techniques, and alignment methods. It is intended for college students, professionals, and practitioners in natural language processing and related fields, and can serve as a reference for anyone interested in large language models.
bookmark · Added on January 15, 2025
· 1h 17m read
bookmark · Added on January 12, 2025
· 35 min read
bookmark · Added on January 11, 2025
· 53 min read
bookmark · Added on January 11, 2025
· 1h 8m read
bookmark · Added on January 10, 2025
· 8 min read
The links below are to various freely (and legitimately!) available online mathematical resources for those interested in category theory at an elementary/intermediate level. There is supplementary page, introductory readings for philosophers, for reading suggestions for those looking for the most accessible routes into category theory and/or links to philosophical discussions. A gentle introduction? My Category … Category Theory: Lecture Notes and Online Books Read More »
bookmark · Added on January 9, 2025
· 1 min read
A high-performance and high-level purely functional data-parallel array programming language that can execute on the GPU and CPU.
bookmark · Added on January 7, 2025
· 1h 51m read
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with
671B total parameters with 37B activated for each token. To achieve efficient
inference and cost-effective training, DeepSeek-V3 adopts Multi-head Latent
Attention (MLA) and DeepSeekMoE architectures, which were thoroughly validated
in DeepSeek-V2. Furthermore, DeepSeek-V3 pioneers an auxiliary-loss-free
strategy for load balancing and sets a multi-token prediction training
objective for stronger performance. We pre-train DeepSeek-V3 on 14.8 trillion
diverse and high-quality tokens, followed by Supervised Fine-Tuning and
Reinforcement Learning stages to fully harness its capabilities. Comprehensive
evaluations reveal that DeepSeek-V3 outperforms other open-source models and
achieves performance comparable to leading closed-source models. Despite its
excellent performance, DeepSeek-V3 requires only 2.788M H800 GPU hours for its
full training. In addition, its training process is remarkably stable.
Throughout the entire training process, we did not experience any irrecoverable
loss spikes or perform any rollbacks. The model checkpoints are available at
https://github.com/deepseek-ai/DeepSeek-V3.
bookmark · Added on January 2, 2025
· 52 min read
—————SOURCES————————————————————————
Percolation – Béla Bollobás and Oliver Riordan
Cambridge University Press, New York, 2006.
Sixty Years of Percolation – Hugo Duminil-Copin
https://www.ihes.fr/~duminil/publi/2018ICM.pdf
Percolation – Geoffrey Grimmett
volume 321 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin, second edition, 1999.
—————NOTES—————————————————————————
Note at 10:42 – The uniqueness of the infinite cluster is known for the d-dimenional lattice since the works of Aizenman, Kesten and Newman - [Uniqueness of the infinite cluster and continuity of connectivity functions for short and long range percolation (1987)] and Burton and Keane - [Density and uniqueness in percolation (1989)]. It does not hold in general: when the graph in question is a regular tree for example, there are always infinitely many clusters during the supercritical phase.
The two last results shown here are only known for site percolation (in which vertices are open or closed instead of edges) in the triangular lattice, where a scaling limit for the boundaries of critical clusters was proved to exist (more on that in the third note). It is believed that these results are universal, that is, valid in great generality for planar percolation processes near criticality.
The third result is from an appendix by Gábor Pete in the paper [Scaling limits for the threshold window: When does a monotone Boolean function flip its outcome? (2017)] by Ahlberg and Steif. Consider an n by n box, and the event where there exists a left-right crossing of said box. Recall the uniform coupling from the video: intuitively, the result is saying that the point at which this crossing emerges in the uniform coupling is with high probability inside an interval of size n^{-3/4} around 1/2.
The fourth result is saying that the average size of the cluster of the origin (or any other given point) goes to infinity as we let p approach the critical parameter like a specific power of the distance between p and p_c. This power is called a critical exponent. The existence of these exponents was proved by Smirnov and Werner in the paper [Critical exponents for two-dimensional percolation (2001)].
Note at 10:52 – Hugo Duminil-Copin has several major contributions to the study of processes arising in statistical physics, including Bernoulli percolation. Among his works on Ising and Ising-like processes we can cite [Random Currents and Continuity of Ising Model’s Spontaneous Magnetization (2015)] with Aizenman and Sidoravicius and [Sharp phase transition for the random-cluster and Potts models via decision trees (2019)] with Raoufi and Tassion.
Note at 12:38 – In the triangular lattice site percolation, Stanislav Smirnov proved the conformal invariance of crossing probabilities at criticality (see https://www.unige.ch/~smirnov/papers/icmp-final.pdf for an overview), which led to the proof of the existence of scaling limits of exploration curves as Schramm–Loewner evolution processes. See [Critical percolation in the plane (2009)] by Smirnov. This provided a deep understanding of the critical phase in the triangular lattice site percolation, which to this day is not extended to the square lattice.
Note at 17:52 – It is not at all obvious that the probability of being connected to infinity is continuous above criticality. This result can be proved in the d-dimenional hypercubic lattices using the uniqueness of the infinite cluster, and more generally it was proved for transitive graphs (intuitively, graphs in which all vertices look the same) by Häggström, Peres and Schonmann in [Percolation on transitive graphs as a coalescent process: Relentless merging followed by simultaneous uniqueness (1999)].
—————SECTIONS———————————————————————
0:00 Introduction
1:37 Definition – Bernoulli Percolation
5:23 Definition – Uniform Coupling
7:56 Exploration – High-Resolution Square Grid
9:40 Exploration – Questions and Kesten's Theorem
10:58 Exploration – Ising Model
11:54 Exploration – Critical Percolation
12:50 Exploration – Three-Dimensional Cubic Lattice and Beyond
14:13 Proof – Theorem Statement
15:14 Proof – Simplifications
16:29 Proof – Definition of Critical Parameter
18:41 Proof – Critical Parameter is Greater Than Zero
20:44 Proof – Duality Definition
21:56 Proof – Critical Parameter is Less Than One
25:16 Proof – Summary and Idea for Kesten's Theorem
26:11 Conclusion
—————CREDITS————————————————————————
Caio Alves – writing, 3D animation
Aranka Hrušková – writing, clarinet
Vilas Winstein – writing, 2D animation, editing, voice-over
Special thanks to Anisah Awad, Gábor Pete, Jyotsna Sreenivasan, Angie Zavala
This video is an entry in the second Summer of Mathematics Exposition (#SoME2)
The photographs used in this video are licensed under the Creative Commons Attribution-ShareAlike license:
https://creativecommons.org/licenses/by-sa/4.0/deed.en
Uploader: Spectral Collective
Duration: 1612s
Views: 455517