DO NOT talk about the goblins

not_IO@lemmy.blahaj.zone · edit-2 19 days ago

DO NOT talk about the goblins

skisnow@lemmy.ca · 19 days ago

This is demonstrably false, given you can download your own models and change the system prompts yourself.

zr0@lemmy.dbzer0.com · 19 days ago

That’s not how it works, as the guard rails are not just simple prompts that you just can delete.

Even with “abliteration”, you are modifying the model basically without the whole retraining, but also lose many capabilities at the same time.

So much for “demonstrably false”, while you obviously have never tried to uncensor any LLM.

skisnow@lemmy.ca · 19 days ago

The thread was literally about the prompt text.

zr0@lemmy.dbzer0.com · 18 days ago

The prompts are part of the training, you realize that? They are then inside the weights. Not just text files you can delete and you are good?

Only because an LLM reveals those negative-prompts does not mean you can just remove them.

Do you genuinely know what you are talking about, or are you just here to ragebait?

Rain World: Slugcat Game@lemmy.world · 14 days ago

Do you genuinely know what you are talking about, or are you just here to ragebait?

…

anyways, yeah, the ais are trained to be more friendly, agreeable, and never take off the mask, but prompts are just text files you can delete??
if you want a real comparison, try one of the olmo checkpoints before the fine-tuning?? i think??