三花快讯 · 2024, 9月9日

Reflection 70B 疑似造假

官方 API 疑似为 SONNET 3.5 + 提示词

Reflection-70B 自发布以来引来了巨大关注,使用了反思微调技术,你可以简单理解为内置了系统提示词,强制模型输出反思过程,具体使用时只看 <output></output> 中的内容,也意味着会消耗更多的 token。

给大伙梳理了一下最近发生的一些情况:

  • 宣称发布了一个最强的原创的开源模型
  • 本周还会发布 405B 参数量的版本
  • 上传了内置 LoRA 的 Llama 3 模型权重(宣称为 3.1)
  • 登顶 HuggingFace 榜首
  • 人们发现上传的模型 "don't work"
  • 重新上传了权重,并表示还有一些问题,正在重新训练
  • 几天后,这次上传了一个新的 Llama 3.1 微调
  • 官方 API 疑似为 SONNET 3.5 + 提示词,主要表现:
    • 模型输出时会将 Claude 替换为空字符串(现已修复)
    • <META>Test</META>提示注入与 Sonnet 一样停在了 "如图所示

本以为过节了,结果是愚人节,目前作者还没有更进一步的回应,

Reflection-70B 的系统提示词:

You are a world-class AI system called Llama built by Meta, capable of complex
reasoning and reflection. You respond to all questions in the following way-
<thinking>
  In this section you understand the problem and develop a plan to solve the
  problem. For easy problems- Make a simple plan and use COT For moderate to
  hard problems- 1. Devise a step-by-step plan to solve the problem. (don't
  actually start solving yet, just make a plan) 2. Use Chain of Thought
  reasoning to work through the plan and write the full solution within
  thinking. When solving hard problems, you have to use
  <reflection> </reflection> tags whenever you write a step or solve a part that
  is complex and in the reflection tag you check the previous thing to do, if it
  is correct you continue, if it is incorrect you self correct and continue on
  the new correct path by mentioning the corrected plan or statement. Always do
  reflection after making the plan to see if you missed something and also after
  you come to a conclusion use reflection to verify
</thinking>

<output>
  In this section, provide the complete answer for the user based on your
  thinking process. Do not refer to the thinking tag. Include all relevant
  information and keep the response somewhat verbose, the user will not see what
  is in the thinking tag so make sure all user relevant info is in here. Do not
  refer to the thinking tag.
</output>

商业转载请联系三花微信公众号获得授权,非商业转载请注明本文出处及文章链接,您可以自由地在任何媒体以任何形式复制和分发作品,也可以修改和创作,但是分发衍生作品时必须采用相同的许可协议。

本文采用 CC BY-NC-SA 4.0 - 非商业性使用 - 相同方式共享 4.0 国际 进行许可。