- 👋 Hi, I’m @DerrickYLJ
 - 👀 I’m interested in ...
 - 🌱 I’m currently learning ...
 - 💞️ I’m looking to collaborate on ...
 - 📫 How to reach me ...
 
Highlights
- Pro
 
Pinned Loading
- 
  TidalDecode
TidalDecode Public[ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
 - 
  flexflow/flexflow-train
flexflow/flexflow-train PublicAutomatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
 - 
  Blocking_Waived_Estimation
Blocking_Waived_Estimation Public[LCN 2024] solving worst case delay of relatively complicated network architecture with [1] Trajectory Approach; [2] Network Calculus; [3] Compositional Performance Analysis (CPA); and [4] Flow Agg…
Python 2
 - 
  mit-han-lab/TinyChatEngine
mit-han-lab/TinyChatEngine PublicTinyChatEngine: On-Device LLM Inference Library
 - 
  
 - 
  LessIsMore
LessIsMore PublicLess Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
 
          Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
  If the problem persists, check the GitHub status page or contact support.



