awful.systems
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
@Fallstar@mander.xyz to Technology@lemmy.worldEnglish • 1 month ago

A weird phrase is plaguing scientific papers – and we traced it back to a glitch in AI training data

theconversation.com

external-link
message-square
19
fedilink
  • cross-posted to:
  • hackernews@lemmy.bestiver.se
319
external-link

A weird phrase is plaguing scientific papers – and we traced it back to a glitch in AI training data

theconversation.com

@Fallstar@mander.xyz to Technology@lemmy.worldEnglish • 1 month ago
message-square
19
fedilink
  • cross-posted to:
  • hackernews@lemmy.bestiver.se
Once errors creep into the AI knowledge base, they can be very hard to get out.
  • @Telorand@reddthat.com
    link
    fedilink
    English
    17•1 month ago

    Sure, and I’m sympathetic to the baffling difficulties of English, but use Google Translate and ask someone who’s more fluent for help with the final polish (as a single suggestion). Trusting your work, trusting science to an LLM is lunacy.

    • @Saleh@feddit.org
      link
      fedilink
      English
      14•
      edit-2
      1 month ago

      Google translate is using the same approach like an LLM.

      https://en.wikipedia.org/wiki/Google_Translate
      https://en.wikipedia.org/wiki/Neural_machine_translation

      So is DeepL

      https://en.wikipedia.org/wiki/DeepL_Translator

      And before they were using neural network approaches they used statistical approaches, which are subject to the same errors as a result of bad training data.

      • @wewbull@feddit.uk
        link
        fedilink
        English
        2•1 month ago

        Check the results though. Google translate is far far better at translation than a generic LLM.

    • @Squirrelsdrivemenuts@lemmy.world
      link
      fedilink
      English
      8•1 month ago

      It might be hard for them to find someone who is both fluent in english AND knows the field well enough to know vegetative electron microscopy is not a thing. Most universities have one general translation help service and science has a lot of field-specific weird terms.

      • @moakley@lemmy.world
        link
        fedilink
        English
        1•1 month ago

        That’s why he said start with Google Translate. Because Google Translate isn’t giving gibberish like vegetative electron microscopy.

Technology@lemmy.world

!technology@lemmy.world

remote_follow_modal_title

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @L4s@lemmy.world
  • @autotldr@lemmings.world
  • @PipedLinkBot@feddit.rocks
  • @wikibot@lemmy.world
  • 4.89K users / day
  • 9.11K users / week
  • 15.8K users / month
  • 34.9K users / 6 months
  • 70.3K subscribers
  • 13.9K Posts
  • 554K Comments
  • Modlog
  • mods:
  • @L3s@lemmy.world
  • enu
  • Technopagan
  • L4sBot
  • L3s
  • @L4s@hackingne.ws
  • BE: 0.19.3
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org