DeepSeek releases new image model family

@spaduf@slrpnk.net · 3 months ago

DeepSeek releases new image model family

lnxtx (xe/xem/xyr) · 3 months ago

What happened in 1989?

Jesus · 3 months ago

Phoenixz · 3 months ago

Question: as i understood it so far, this thing is open source and so is the dataset.

With that, why would it still obey Chinese censorship?

@thedarkfly@feddit.nl · 3 months ago

Even though it’s magnitudes lower than comparable models, Deepseek still cost millions to train. Unless someone’s willing to invest this just to retrain it from scratch, you’re left with the alignment of its trainers.

Phoenixz · 2 months ago

Good point.

Is the training set malleable, though? Could you give it some additional rules to basically sidestep this?

@thedarkfly@feddit.nl · 2 months ago

Yeah, I guess you could realign it without retraining the whole thing! Dunno what would be the cost though, sometimes this is done with a cohort of human trainers 😅

Phoenixz · 2 months ago

I feel like we’re talking about a guard dog now…

@Jackinopolis@sh.itjust.works · 3 months ago

It’s baked into the training. It’s not a simple thing to take it out. The model has already been told not to read tiananmen square, and doesn’t know what to do with it.

@surewhynotlem@lemmy.world · 3 months ago

Now I’ll never finish that history assignment…

DeepSeek releases new image model family

DeepSeek releases new image model family

Viral AI company DeepSeek releases new image model family | TechCrunch