awful.systems
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
@Fallstar@mander.xyz to Technology@lemmy.worldEnglish • 1 month ago

A weird phrase is plaguing scientific papers – and we traced it back to a glitch in AI training data

theconversation.com

external-link
message-square
19
fedilink
  • cross-posted to:
  • hackernews@lemmy.bestiver.se
319
external-link

A weird phrase is plaguing scientific papers – and we traced it back to a glitch in AI training data

theconversation.com

@Fallstar@mander.xyz to Technology@lemmy.worldEnglish • 1 month ago
message-square
19
fedilink
  • cross-posted to:
  • hackernews@lemmy.bestiver.se
Once errors creep into the AI knowledge base, they can be very hard to get out.
  • @Saleh@feddit.org
    link
    fedilink
    English
    14•
    edit-2
    1 month ago

    Google translate is using the same approach like an LLM.

    https://en.wikipedia.org/wiki/Google_Translate
    https://en.wikipedia.org/wiki/Neural_machine_translation

    So is DeepL

    https://en.wikipedia.org/wiki/DeepL_Translator

    And before they were using neural network approaches they used statistical approaches, which are subject to the same errors as a result of bad training data.

    • @wewbull@feddit.uk
      link
      fedilink
      English
      2•1 month ago

      Check the results though. Google translate is far far better at translation than a generic LLM.

Technology@lemmy.world

!technology@lemmy.world

remote_follow_modal_title

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @L4s@lemmy.world
  • @autotldr@lemmings.world
  • @PipedLinkBot@feddit.rocks
  • @wikibot@lemmy.world
  • 4.89K users / day
  • 9.11K users / week
  • 15.8K users / month
  • 34.9K users / 6 months
  • 70.3K subscribers
  • 13.9K Posts
  • 554K Comments
  • Modlog
  • mods:
  • @L3s@lemmy.world
  • enu
  • Technopagan
  • L4sBot
  • L3s
  • @L4s@hackingne.ws
  • BE: 0.19.3
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org