Pro to

Cybersecurity@sh.itjust.worksEnglish • 2 days ago

AI agents outperform human teams in hacking competitions

the-decoder.com

10

AI agents outperform human teams in hacking competitions

the-decoder.com

Pro to

Cybersecurity@sh.itjust.worksEnglish • 2 days ago

A recent series of cybersecurity competitions organized by Palisade Research shows that autonomous AI agents can compete directly with human hackers, and sometimes come out ahead.

In two hacker competitions run by Palisade Research, autonomous AI systems matched or outperformed human professionals in demanding security challenges.

In the first contest, four out of seven AI teams scored 19 out of 20 points, ranking among the top five percent of all participants, while in the second competition, the leading AI team reached the top ten percent despite facing structural disadvantages.

According to Palisade Research, these outcomes suggest that the abilities of AI agents in cybersecurity have been underestimated, largely due to shortcomings in earlier evaluation methods.

Chat

@taladar@sh.itjust.works
link
fedilink
English
10•2 days ago

The event’s puzzles were designed so they could be solved locally, making them accessible even to AI models with technical constraints.

Want to bet that those puzzles (or some very similar ones) were part of the training data of some of the agents?
- @redsand@lemmy.dbzer0.com
  link
  fedilink
  English
  7•
  edit-2
  2 days ago
  Train AI on demo
  
  Show AI rip through demo
  
  ???(rip off investors)
  
  Profit

Cybersecurity@sh.itjust.works

!cybersecurity@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !cybersecurity@sh.itjust.works

c/cybersecurity is a community centered on the cybersecurity and information security profession. You can come here to discuss news, post something interesting, or just chat with others.

THE RULES

Instance Rules

Be respectful. Everyone should feel welcome here.
No bigotry - including racism, sexism, ableism, homophobia, transphobia, or xenophobia.
No Ads / Spamming.
No pornography.

Community Rules

Idk, keep it semi-professional?
Nothing illegal. We’re all ethical here.
Rules will be added/redefined as necessary.

If you ask someone to hack your “friends” socials you’re just going to get banned so don’t do that.

Learn about hacking

Pico Capture the flag

Other security-related communities !databreaches@lemmy.zip !netsec@lemmy.world !securitynews@infosec.pub !cybersecurity@infosec.pub !pulse_of_truth@infosec.pub

Notable mention to !cybersecuritymemes@lemmy.world

169 users / day
445 users / week
1.3K users / month
4.34K users / 6 months
7.27K subscribers
2.6K Posts
4.01K Comments
Modlog