LLaDA实现 #197

MoringLotus · 2026-01-22T05:05:57Z

描述内容：

核心实现 (Core Implementation)
模型架构构建：完成了模型核心前向传播链路的开发，包括 Embedding 层、FFN、Router 以及 Expert 模块的逻辑实现。

算子开发：完成了 BiAttention 算子的自主实现，并已正式提交至 InfiniCore 仓库。

工程适配与集成 (Engineering & Integration)
主要算子集成：完成了核心算子在框架内的集成与推理模型的全链路集成。

PR 提交记录：
InfiniLM (PR #197)：模型主体框架与推理逻辑集成。
InfiniCore (PR #966)：BiAttention 算子实现及针对 64 专家配置的相关专家算子适配工作。

后续计划 (Future Work)
代码迭代优化：基于当前已跑通的框架，后续将进一步对部分算子实现及推理效率进行迭代优化。
精度深度对齐：针对数值稳定性与精度对齐进行持续跟踪与微调。

…private-changes PRIVATE

MoringLotus and others added 28 commits November 25, 2025 12:57

LLaDA config init

4c83d61

Merge branch 'InfiniTensor:main' into main

d8f2814

basic LLaDA framework structure

0b91ebe

Next Work: launch device

9f38f21

1126_scratch

980dbe7

clear enviroment

132f8c0

Weights and Meta

23c53dc

scratch meta and weights

368a16e

Fix Weights Impl

5f36403

Scratch Model && Resource Apply

5fb6f15

A mistake of unsupport data type, try fix

766ba84

Solve Sin Table Question

612fb6b

Before Infer Request

a34238e

233

f67a31d

Fix some point and ref in llada.cpp

8b359c4

before inferbatchllada

726b25a

retry

cd04080

single thread shoule finish inferDevice batch

36ea143

fix a crazy bug, you can see xmake lua will lead to ruin[TODO]

0a7e522

Infer Deice Batch (TO ROPE)

fdaea62

finish attention score

21a58d8

december 17th develop MoE

239f1d7

finish expert weight load

2566803

finish router compute

b3f62fb

topkrouter finish

eb8d0ff

finish router

b795b19

vannila infer

fa57e91

Merge remote-tracking branch 'private-repo/single-thread' into merge-…

b93a4c2

…private-changes PRIVATE

MoringLotus requested a review from a team January 22, 2026 05:05

MoringLotus changed the title ~~LLaDA实现（暂未全部完成）~~ LLaDA实现 Jan 26, 2026

greedy decoding

9d70ac5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLaDA实现 #197

LLaDA实现 #197

Uh oh!

MoringLotus commented Jan 22, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

LLaDA实现 #197

Are you sure you want to change the base?

LLaDA实现 #197

Uh oh!

Conversation

MoringLotus commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

MoringLotus commented Jan 22, 2026 •

edited

Loading