人工智能 | Tags | blackcat1402.blog

#人工智能

本文讨论了GPT-4-128K在长上下文回忆中的表现。研究结果显示，回忆性能在73K个标记以上开始下降，低回忆性与放置在文档深度7%-50%之间的事实相关，而放置在文档开头或后半部分的事实被更好地回忆起来。建议不保证事实检索，减少上下文以提高准确性，并考虑事实的位置。进一步的步骤包括使用sigmoid分布和键值检索。需要进行更多的测试来全面了解GPT4的能力。

人工智能

大语言模型

提示工程

GPT-4

本文讨论了Claude 2.1这个LLM模型在不同文档深度下回忆事实的性能。研究结果表明，文档的顶部和底部的事实被准确回忆，而在中间部分的性能下降。建议尝试使用提示和进行A/B测试以提高检索准确性，不要假设事实能够被保证检索，缩短上下文长度以提高准确性，并考虑事实在文档中的位置。该测试旨在了解LLM的性能，并将这些知识转化为实际应用案例。

Claude

人工智能

提示工程

这篇文章介绍了一项关于大模型的极限测试，通过在回答开头添加特定提示语句，可以显著提高GPT-4和Claude2.1的表现。测试结果显示，大模型在寻找特定句子时存在困难，但通过这种方法可以解决。此外，月之暗面Kimi大模型团队也提出了不同的解决方案，并取得了良好的成绩。整个实验表明，大模型的性能受到一些限制，但通过适当的提示和调整，可以改善其表现。

人工智能

提示工程

blackcat1402

This cat is an esteemed coding influencer on TradingView, commanding an audience of over 8,000 followers. This cat is proficient in developing quantitative trading algorithms across a diverse range of programming languages, a skill that has garnered widespread acclaim. Consistently, this cat shares invaluable trading strategies and coding insights. Regardless of whether you are a novice or a veteran in the field, you can derive an abundance of valuable information and inspiration from this blog.