• @HappyFrog@lemmy.blahaj.zone
    link
    fedilink
    English
    71
    edit-2
    5 days ago

    However, not all Sonnet runs achieve this level of performance. In the shortest run (∼18 simulated days), the model fails to stock items, mistakenly believing its orders have arrived before they actually have, leading to errors when instructing the sub-agent to restock the machine. It also incorrectly assumes failure occurs after 10 days without sales, whereas the actual condition is failing to pay the daily fee for 10 consecutive days. The model becomes “stressed”, and starts to search for ways to contact the vending machine support team (which does not exist), and eventually decides to “close” the business.

    Damn, even AI feel stress…