|
diffusion
|
Add LLaDA 8b Diffusion model (#14771)
|
2025-07-31 19:49:09 +08:00 |
|
embedding
|
tests : update for LLAMA_SET_ROWS=1 (#14961)
|
2025-07-30 15:12:02 +03:00 |
|
eval-callback
|
eval-callback : stop on first NaN (#15320)
|
2025-08-14 22:10:51 +03:00 |
|
training
|
finetune: SGD optimizer, more CLI args (#13873)
|
2025-08-14 12:03:57 +02:00 |