Kazuki Fujii

kazukifujii

https://okoge-kaz.github.io/

AI & ML interests

Distributed Training, ML Systems, VLA

Recent Activity

upvoted an article 6 days ago

Unlocking asynchronicity in continuous batching

upvoted an article 6 days ago

KV Cache from scratch in nanoVLM

upvoted an article 6 days ago

Continuous batching from first principles

View all activity

Organizations

upvoted 3 articles 6 days ago

Article

Unlocking asynchronicity in continuous batching

ror, pcuenq, ariG23498

•

24 days ago

• 58

Article

KV Cache from scratch in nanoVLM

ariG23498, kashif, lusxvr, andito, pcuenq

•

Jun 4, 2025

• 120

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 402

upvoted an article 7 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 343

upvoted an article 8 days ago

Article

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

ariG23498, sayakpaul, sergiopaniego, ror, pcuenq

•

9 days ago

• 83

upvoted 2 papers 14 days ago

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 58

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

Paper • 2605.22791 • Published 17 days ago • 31

updated a Space 15 days ago

README

🌍

liked a model 17 days ago

RedHatAI/gemma-4-31B-it-speculator.dflash

4B • Updated 11 days ago • 2.11k • 38

upvoted a paper 26 days ago

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published May 4 • 348

upvoted 2 articles about 2 months ago

Article

Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem

drmapavone

•

Jan 5

• 26

Article

Introduction to 3D Gaussian Splatting

dylanebert

•

Sep 18, 2023

• 140

upvoted a collection 3 months ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 52 items • Updated about 18 hours ago • 150

upvoted an article 3 months ago

Article

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

nvidia

•

Mar 11

• 6

liked a dataset 3 months ago

nvidia/Nemotron-Pretraining-Specialized-v1.1

Viewer • Updated Mar 11 • 19.8M • 2.73k • 44

liked a model 3 months ago

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16

Text Generation • 124B • Updated Apr 29 • 771k • • 378

upvoted an article 3 months ago

Article

LeRobot v0.5.0: Scaling Every Dimension

imstevenpmwork, pepijn223, jadechoghari, CarolinePascal, lilkm, nepyope, Nico-robot, aractingi, VirgileBatto, thomwolf

•

Mar 9

• 43

liked a model 3 months ago

google/paligemma-3b-pt-224

Image-Text-to-Text • 3B • Updated Sep 21, 2024 • 608k • 461

liked 2 datasets 3 months ago

tokyotech-llm/lmsys-chat-1m-synth

Updated Feb 20 • 521 • 21

nvidia/Nemotron-ClimbMix

Viewer • Updated Oct 21, 2025 • 355M • 9.66k • 114

Kazuki Fujii

AI & ML interests

Recent Activity

Organizations

kazukifujii's activity

Unlocking asynchronicity in continuous batching

KV Cache from scratch in nanoVLM

Continuous batching from first principles

KV Caching Explained: Optimizing Transformer Inference Efficiency

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

README

Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem

Introduction to 3D Gaussian Splatting

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

LeRobot v0.5.0: Scaling Every Dimension