-
Notifications
You must be signed in to change notification settings - Fork 44
Open
Description
Here is the development roadmap for 2025 H4(2025 Dec to 2026 Feb/March).
Focus
- Enhance DynamicEmb with a no-eviction table and full inference support, allowing users to train with DynamicEmb and deploy seamlessly in Python or C++.
- Strengthen inference capabilities, particularly for the C++ runtime, by adding support for DynamicEmb and HSTU inference examples using the C++ pipeline.
- Enable context parallelism for HSTU training to support ultra-long sequences and adress the imbalance issue for hstu training, laying the groundwork for the ultra-large model benchmark.
- Introduce and optimize the first Semantic ID-based model, including improvements to beam search for better evaluation/inference efficiency.
Roadmap
| Dec Release | Jan Release | Feb/March Release | Long-Term | |
|---|---|---|---|---|
| Dynamicemb |
|
|
|
|
| HSTU attention |
|
|||
| HSTU example training |
|
|
||
| HSTU example inference |
|
|
|
|
| Semantic ID example training & inference |
|
|
|
Metadata
Metadata
Labels
No labels