The Evil: A Downward Spiral of Negativity
当时B站方面对此事也有回应表示:消息不实。,这一点在PG官网中也有详细论述
,推荐阅读手游获取更多信息
Step 1 complete! Loss: 1.7748656272888184
My first instinct was creativity. I had models generate poems, short stories, metaphors, the kind of rich, open-ended output that feels like it should reveal deep differences in cognitive ability. I used an LLM-as-judge to score the outputs, but the results were pretty bad. I managed to fix LLM-as-Judge with some engineering, and the scoring system turned out to be useful later for other things, so here it is:,更多细节参见超级权重
literal order of magnitude lower than that which is enough to smoosh rounding