From Embedded to LLM Platform | Model Inference & Deployment | vLLM · FastAPI · React
从嵌入式到 LLM 平台,跨越硬件与 AI 的全栈工程师。专注模型推理部署与 AI 应用工程化,让大模型在真实业务中稳定运行。
|
LLM training-inference platform with OpenAI-compatible Chat API, pluggable inference backends, and Verilog AI-assisted programming
|
Combines YOLOv8 vision models with general LLM for online defect detection and judgment
|
||||||||||||||||||
|
Enterprise knowledge base + Text-to-SQL + Agent RAG implementation
|
UMAP dimensionality reduction + HDBSCAN clustering, high-dimensional data visualization
|
| AI/ML | Python Backend | Infrastructure |
|---|---|---|
| RAG · LLM · vLLM | FastAPI · Django · Async | Docker Compose · Nginx |
| Agent · Model Evaluation | Celery · Redis · SQLModel | GitHub Actions · Gitea CI |
| PyTorch · MLX | Pydantic · PostgreSQL | Azure ML · DevOps |
| Prompt Engineering | React · TypeScript | Monitoring · CI/CD |
- Agentic AI systems & multi-agent orchestration
- LLM fine-tuning & model evaluation
- RAG optimization & retrieval quality
- Agent EDA Platform - LLM training-inference platform with Verilog AI-assisted programming
- RAG Q&A System - Enterprise knowledge base with natural language query
- AI Inference Engine - Optimized inference for edge deployment
Scan to connect:
| Period | Company | Role |
|---|---|---|
| 2026.03 - Present | 亿方杭创 | AI 应用开发工程师 |
| 2024.11 - 2026.03 | 杭州克雷登工业 | AI 应用工程师 |
| 2024.08 - 2024.10 | 杭州人本集团 | Python 后端开发 |
| 2016.10 - 2023.05 | 诺基亚通信 | 测试开发工程师 |
| 2014.12 - 2016.10 | 杭州格菱科技 | Python 开发 |
| 2010.07 - 2014.09 | 西子优迈 | 软件开发工程师 |
"从嵌入式到 LLM 平台,让大模型在真实业务中稳定运行"
Updated: June 2026



