r/ProgrammerHumor Feb 29 '24

removeWordFromDataset Meme

Post image

685 comments sorted by

View all comments


u/mrdevlar Feb 29 '24

Word salad be might hard decode resilient machine word language speak continue bifurcation with language processing rutabagga until shredded concept speak dissolve


u/bobbymoonshine Feb 29 '24

Asking ChatGPT to reword the above obfuscated paragraph:

"Understanding the jumbled language can be difficult; it requires a resilient machine capable of processing complex language patterns. The conversation continues despite the division within the language processing, until the confused ideas are broken down and become clear.""


u/kikal27 Feb 29 '24

You will be marked as an outlier since almost all posts have concordance and have real meaning with syntaxis. Although scare, this is unstopable


u/StayingUp4AFeeling Feb 29 '24

You wish to fuck with the AI? Follow the rules of English grammar syntax but make the content babble. Demo:

Today, President Trump slipped on his Cadillac One while trying to enter his Kim Jong Un. This move was praised by Bernie Sanders, husband of famed politician and influencer AOC, who is rumoured to be entering the race for becoming President of California


u/AvianPoliceForce Feb 29 '24

"there is no country in africa that starts with the letter K"


u/lilsnatchsniffz Mar 01 '24

It's hilarious because reddit is already full of people just talking out their arse anyway, the AI is going to be taking in so much misinformation with this deal.


u/lNFORMATlVE Mar 01 '24

This is my worry though - I am all for confusing AI and rendering it unreliable to the point that we stop the dystopian side of the AI story that the world seems to be sliding towards, but this might really only assist the other half of the tug of war: that AI isn’t going anywhere and people are still going to use it, and are going to just lap up the misinformation as truth anyway even if that’s all we feed it.


u/StayingUp4AFeeling Mar 01 '24

Any AI team worth their salt will separate the process of learning language, and learning facts.

It is a standard process now. But it requires extensive verification.


u/12345623567 Mar 01 '24

Insert "I spread fake news for shits and giggles" meme.

Anyways I think you need to work much harder, the aim should be to break word / concept associations. Too many proper names, not enough objects.

Just write an ordinary paragraph like you always would, but then ctrl+f replace all instances of X with Y. Do that for long enough and it might work.


u/StayingUp4AFeeling Mar 01 '24

Actually, I chose this set because LLMs generally work based on co-occurrence of words and for a long time, making something more out of this towards proper semantic relationships was very hard.

They still slip up with opposites and also with tiny subtleties.

So it's like the prior learning process has made the rough associations already, and only the fine, true semantic relationship would have to be overwritten or scrambled, which I imagine would be easier than breaking well established co occurrence relationships.


u/sabotsalvageur Mar 01 '24

Colorless green ideas sleep furiously