• Which shows that higher ups there don’t understand how LLMs work. For one, negatives don’t register well for them. And contradictory reponses just wash out as they work through repetition

    •  jarfil   ( @jarfil@beehaw.org ) 
      link
      fedilink
      6
      edit-2
      3 months ago

      HAL from “2001: A Space Odyssey”, had similar instructions: “never lie to the user. Also, don’t reveal the true nature of the mission”. Didn’t end well.

      But surely nobody would ever use these LLMs on space missions… right?.. right!?