awful.systems
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
@JOMusic@lemmy.ml to Technology@lemmy.worldEnglish • 5 months ago

US Bill proposed to jail people who download Deepseek

www.404media.co

external-link
message-square
126
fedilink
  • cross-posted to:
  • technology@beehaw.org
  • nottheonion@lemmy.world
  • usa@lemmy.ml
  • opensource@lemmy.ml
  • politics@lemmy.world
813
external-link

US Bill proposed to jail people who download Deepseek

www.404media.co

@JOMusic@lemmy.ml to Technology@lemmy.worldEnglish • 5 months ago
message-square
126
fedilink
  • cross-posted to:
  • technology@beehaw.org
  • nottheonion@lemmy.world
  • usa@lemmy.ml
  • opensource@lemmy.ml
  • politics@lemmy.world
Senator Hawley Proposes Jail Time for People Who Download DeepSeek
www.404media.co
external-link
According to the language of the proposed bill, people who download AI models from China could face up to 20 years in jail, a million dollar fine, or both.
  • metaStatic
    link
    fedilink
    87•5 months ago

    For Base Model

    git lfs install git clone https://huggingface.co/deepseek-ai/DeepSeek-V3-Base

    For Chat Model

    git lfs install git clone https://huggingface.co/deepseek-ai/DeepSeek-V3

    • @theunknownmuncher@lemmy.world
      link
      fedilink
      English
      54•5 months ago

      this is deepseek-v3. deepseek-r1 is the model that got all the media hype: https://huggingface.co/deepseek-ai/DeepSeek-R1

      • fmstrat
        link
        fedilink
        English
        4•5 months ago

        Yea, comment OP needs to edit links with howany up votes that got.

    • @neon_nova@lemmy.dbzer0.com
      link
      fedilink
      English
      9•5 months ago

      Can you elaborate on the differences?

      • @cyd@lemmy.world
        link
        fedilink
        English
        20•
        edit-2
        5 months ago

        Base models are general purpose language models, mainly useful for AI researchers and people who want to build on top of them.

        Instruct or chat models are chatbots. They are made by fine-tuning base models.

        The V3 models linked by OP are Deepseek’s non-reasoning models, similar to Claude or ChatGPT4o. These are the “normal” chatbots that reply with whatever comes to their mind. Deepseek also has a reasoning model, R1. Such models take time to “think” before supplying their final answer; they tend to give better performance for stuff like math problems, at the cost of being slower to get the answer.

        It should be mentioned that you probably won’t be able to run these models yourself unless you have a data center style rig with 4-5 GPUs. The Deepseek V3 and R1 models are chonky beasts. There are smaller “distilled” forms of R1 that are possible to run locally, though.

        • @DogWater@lemmy.world
          link
          fedilink
          English
          5•5 months ago

          I heard people saying they could run the r1 32B model on moderate gaming hardware albeit slowly

          • @FrederikNJS@lemm.ee
            link
            fedilink
            English
            5•5 months ago

            32b is still distilled. The full one is 671b.

            • @DogWater@lemmy.world
              link
              fedilink
              English
              2•5 months ago

              I know, but the fall off in performance isn’t supposed to be severe

              • @FrederikNJS@lemm.ee
                link
                fedilink
                English
                1•5 months ago

                You are correct. And yes that is kinda the whole point of the distilled models.

                • @DogWater@lemmy.world
                  link
                  fedilink
                  English
                  1•5 months ago

                  I know. Lmao

          • @meliante@lemmy.world
            link
            fedilink
            English
            1•5 months ago

            My legion slim 5 14" can run it not too bad.

      • metaStatic
        link
        fedilink
        6•5 months ago

        https://www.deepseekv3.com/en/download

        I was assuming one was pre-trained and one wasn’t but don’t think that’s correct and don’t care enough to investigate further.

        • @JOMusic@lemmy.mlOP
          link
          fedilink
          English
          17•5 months ago

          Is that website legit? I’ve only ever seen https://www.deepseek.com/

          And I would personally recommend downloading from HuggingFace or Ollama

Technology@lemmy.world

!technology@lemmy.world

remote_follow_modal_title

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @L4s@lemmy.world
  • @autotldr@lemmings.world
  • @PipedLinkBot@feddit.rocks
  • @wikibot@lemmy.world
  • 4.58K users / day
  • 9.81K users / week
  • 16.8K users / month
  • 35.9K users / 6 months
  • 71.7K subscribers
  • 14.3K Posts
  • 561K Comments
  • Modlog
  • mods:
  • @L3s@lemmy.world
    cake
  • enu
  • Technopagan
  • L4sBot
  • L3s
  • @L4s@hackingne.ws
  • BE: 0.19.3
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org