焰影
@kidmartian
Fri, Nov 21, 2025 6:11 AM
Fri, Nov 21, 2025 6:12 AM
下一次讀書會題材:
張文鈿 on Facebook
Testing Binary vs Score Evals on the Latest Models
https://arxiv.org/...
Confidence Improves Self-Consistency in LLMs
焰影
@kidmartian
Fri, Nov 21, 2025 6:16 AM
LLM as a Judge: 用語言模型來評估好壞 · YWC 科技筆記
焰影
@kidmartian
Fri, Nov 21, 2025 6:17 AM
Generative AI 技術交流中心 |... on Facebook
載入新的回覆