A new study from the Anthropic Fellows Program reveals a technique to identify, monitor and control character traits in large language models (LLMs). The findings show that models can develop ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results