Couldn't have happened to a nicer guy

☆ Yσɠƚԋσʂ ☆ · 4 months ago

Couldn't have happened to a nicer guy

@SplashJackson@lemmy.ca · 4 months ago

What’s a deepseek? Sounds like a search engine?

@Karcinogen@discuss.tchncs.de · 4 months ago

Deepseek is a Chinese AI company that released Deepseek R1, a direct competitor to ChatGPT.

☆ Yσɠƚԋσʂ ☆ · 4 months ago

You forgot to mention that it’s open source.

@SplashJackson@lemmy.ca · 4 months ago

Nice! What are they competing for? I’m new to this AI business thing.

@MajorSauce@sh.itjust.works · 4 months ago

So far, they are training models extremely efficiently while having US gatekeeping their GPUs and doing everything they can to slow their progress. Any innovation in having efficient models to operate and train is great for accessibility of the technology and to reduce the environment impacts of this (so far) very wasteful tech.

मुक्त · 4 months ago

Market share, in a speculated market to be in the future.

@corsicanguppy@lemmy.ca · 4 months ago

Deepseek is a Chinese AI company

Oh. So, military, then.

@bamboo@lemm.ee · 4 months ago

You can say the same thing about any US AI company. Of course the local terrorists want in

@optissima@lemmy.ml · 4 months ago

Based on what info?

@Grapho@lemmy.ml · 4 months ago

How nice of the Chinese military to make their weapon open source and release it to the world lmao

@sverit@lemmy.ml · 4 months ago

https://github.com/deepseek-ai/DeepSeek-R1

@uberstar@lemmy.ml · 4 months ago

I tried DeepSeek, and immediately fell in love… My only nitpick is that images have to have text on them, otherwise it complains, but for the price of free, I’m basically just asking for too much. Contemporaries be damned.

@geneva_convenience@lemmy.ml · 4 months ago

The bubble must be repaired. Pump more cash in!

@NastyNative@mander.xyz · 4 months ago

Free…aint nothing free in this world!

@SacredPony@sh.itjust.works · 4 months ago

deleted by creator

Sabre363 · 4 months ago

We doing paid promotions or something on Lemmy now? You sure seem to be pushing this DeepSeek thing pretty hard, op.

☆ Yσɠƚԋσʂ ☆ · 4 months ago

That’s right I’m a huge open source shill.

Sabre363 · 4 months ago

None of this has anything to do with the model being open source or not, plenty of other people have already disputed that claim.

@Grapho@lemmy.ml · 4 months ago

It’s a model that outperforms the other ones in a bunch of areas with a smaller footprint and which was trained for less than a twentieth of the price, and then it was released as open source.

If it were European or US made nobody would deem it suspicious if somebody talked about it all month, but it’s a Chinese breakthrough and god forbid you talk about it for three days

☆ Yσɠƚԋσʂ ☆ · 4 months ago

It has everything to do with the tech being open. You can dispute it all you like, but the fact is that all the code and research behind it is open. Anybody could build a new model from scratch using open data if they wanted to. That’s what matters.

Sabre363 · 4 months ago

I’m commenting on the odd nature of the post and your behavior in the comments, pointing out that it comes across as more a shallow advertisement than a sincere endorsement, that is all. I don’t know enough about DeepSeek to discuss it meaningfully, nor do I have enough evidence to decide upon its open source status.

☆ Yσɠƚԋσʂ ☆ · 4 months ago

I don’t really care what you think bud. Stay in your lane.

Sabre363 · 4 months ago

You might have a far more positive interaction with the community if you learned to listen first before jumping on the defensive

☆ Yσɠƚԋσʂ ☆ · 4 months ago

Pretty much all my interactions with the community here have been positive, aside from a few toxic trolls such as yourself. Maybe take your own advice there champ.

Sem · 4 months ago

Deepseek collects and process all the data you sent to their LLN even from API calls. It is a no-go for most of businesses applications. For example, OpenAI and Anyhropic do not collect or process anyhow data sent via API and there is an opy-ouy button in their settings that allows to avoid processing of the data sent via UI.

@fl42v@lemmy.ml · 4 months ago

You can run 'em locally, tho, if their gh page is to be believed. And this way you can make sure nothing gets even sent to their servers, and not just believe nothing is processed.

☆ Yσɠƚԋσʂ ☆ · 4 months ago

I got it running with ollama locally, works as advertised

https://medium.com/@pedro.aquino.se/how-to-install-and-use-deepseek-r1-a-free-and-privacy-first-alternative-to-openai-save-c838d2e5e04a

☆ Yσɠƚԋσʂ ☆ · 4 months ago

DeepSeek is an open source project that anybody can run, and it’s performant enough that even running the full model is cheap enough for any company to do.

@shawn1122@lemm.ee · edit-2 4 months ago

Since it’s open source is there a way for companies to adjust so it doesn’t intentionally avoid saying anything bad about China?

☆ Yσɠƚԋσʂ ☆ · 4 months ago

Anybody can adjust the weights any way they want.

@Ajen@sh.itjust.works · 4 months ago

That doesn’t mean it’s straightforward, or even possible, to entirely remove the censorship that’s baked into the model.

@Grapho@lemmy.ml · 4 months ago

People saying truisms that confirm their biases about shit they clearly know nothing about? I thought I’d left reddit.

☆ Yσɠƚԋσʂ ☆ · 4 months ago

It doesn’t mean it’s easy, but it is certainly possible if somebody was dedicated enough. At the end of the day you could even use the open source code DeepSeek published and your own training data to train a whole new model with whatever biases you like.

@Ajen@sh.itjust.works · 4 months ago

“It’s possible, you just have to train your own model.”

Which is almost as much work as you would have to do if you were to start from scratch.

☆ Yσɠƚԋσʂ ☆ · 4 months ago

It’s obviously not since the whole reason DeepSeek is interesting is the new mixture of experts algorithm that it introduces. If you don’t understand the subject then maybe spend a bit of time learning about it instead of adding noise to the discussion?

@HappyTimeHarry@lemm.ee · 4 months ago

If it was actually programed that way then yes you could go in and adjust that, but the model itself is not censored that way and has no problem describing all sorts of Chinese tabboo subjects.

@blarth@thelemmy.club · edit-2 4 months ago

It should be repeated: no American corporation is going to let their employees put data into DeepSeek.

Accept this truth. The LLM you can download and run locally is not the same as what you’re getting on their site. If it is, it’s shit, because I’ve been testing r1 in ollama and it’s trash.

☆ Yσɠƚԋσʂ ☆ · 4 months ago

It should be repeated: anybody can run DeepSeek themselves on premise. You have absolutely no clue what you’re talking about. Keep on coping there though, it’s pretty adorable.

@jol@discuss.tchncs.de · 4 months ago

Where do I find this opy ouy button? Sounds tasty