The majority of “AI Experts” online that I’ve seen are business majors.
Then a ton of junior/mid software engineers who have use the OpenAI API.
Finally are the very very few technical people who have interacted with models directly, maybe even trained some models. Coded directly against them. And even then I don’t think many of them truly understand what’s going on in there.
Hell, I’ve been training models and using ML directly for a decade and I barely know what’s going on in there. Don’t worry I get the image, just calling out how frighteningly few actually understand it, yet so many swear they know AI super well
And even then I don’t think many of them truly understand what’s going on in there.
That’s just the thing about neural networks: Nobody actually understands what’s going on there. We’ve put an abstraction layer over how we do things that we know we will never be able to pierce.
I’d argue we know exactly what’s going on in there, we just don’t necessarily, know for any particular model why it’s going on in there.
But, more importantly, who is going on in there?
And how is it going in there?
Not bad. How’s it going with you?
That’s what we’re trying to find out! We’re trying to find out who killed him, and where, and with what!

Excellent opportunity for a “that’s what she said” joke.
Ding ding ding.
It all became basically magic, blind trial and error roughly ten years ago, with AlexNet.
After AlexNet, everything became increasingly more and more black box and opaque to even the actual PhD level people crafting and testing these things.
Since then, it has basically been ‘throw all existing information of any kind at the model’ to train it better, and then a bunch of basically slapdash optimization attempts which work for largely ‘i dont know’ reasons.
Meanwhile, we could be pouring even 1% of the money going toward LLMs snd convolutional network derived models… into other paradigms, such as maybe trying to actually emulate real brains and real neuronal networks… but nope, everyone is piling into basically one approach.
Thats not to say research on other paradigms is nonexistent, but it is barely existant in comparison.
Feature Visualization How neural networks build up their understanding of images
This method is definitely a great way to achieve some degree of explainability for images, but it is based on the assumption that nearby pixels will have correllated meanings. When AI is making connections between far-away features, or worse, in a feature space that cannot be readily visualized like images can, it can be very hard to decouple the nonlinear outputs into singular linear features. While AI explainability has come a long way in the last few years, the decision-making processes of AI are so different from human thought that even when it can “show its work” by showing which neurons contributed to the final result, it doesn’t necessarily make any intuitive sense to us.
For example, an image-identification AI might identify subtle lens blur data to determine the brand of camera that took a photograph, and then use that data to make an educated guess about which country the image was taken in. It’s a valid path of reasoning. But it would take a lot of effort for a human analyst to notice that the AI is using this process to slightly improve its chances of getting the image identification correct, and there are millions of such derived features that combine in unexpected ways, some logical and some irrationally overfitting to the training data.
Yeah, I’ve trained a number of models (as part of actual CS research, before all of this LLM bullshit), and while I certainly understand the concepts behind training neural networks, I couldn’t tell you the first thing about what a model I trained is doing. That’s the whole thing about the black box approach.
Also why it’s so absurd when “AI” gurus claim they “fixed” an issue in their model that resulted in output they didn’t want.
No, no you didn’t.
Love this because I completely agree. “We fixed it and it no longer does the bad thing”. Uh no, incorrect, unless you literally went through your entire dataset and stripped out every single occurrence of the thing and retrained it, then no there is no way that you 100% “fixed” it
I mean I don’t know for sure but I think they often just code program logic in to filter for some requests that they do not want.
My evidence for that is that I can trigger some “I cannot help you with that” responses by asking completely normal things that just use the wrong word.
It’s not 100%, and you’re more or less just asking the LLM to behave, and filtering the response through another non-perfect model after that which is trying to decide if it’s malicious or not. It’s not standard coding in that it’s a boolean returned - it’s a probability that what the user asked is appropriate according to another model. If the probability is over a threshold then it rejects.
I once trained an AI in Matlab to spell my name.
I alternate between feeling so dumb because that is all that my model could do and feeling so smart because I actually understand the basics of what is happening with AI.
I made a cat detector using Octave. Just ‘detected’ cats in small monochrome bitmaps, but hey, I felt like Neo for a while!
I made a neural net from scratch with my own neural net library that could identify cats from dogs 60% of the time. Better than a coin flip, baybeee!
I made a neural net from scratch with my own neural net library and trained it on generating the next move in a game of Go, based on thousands of games from an online Go forum.
It never even got close to learning the rules.
In retrospect, “thousands of games” was nowhere near enough training data for such a complex task, and if we had had enough training data, we never could have processed all of it, since all we were using was a ca. 2004 laptop machine with no GPU. So we just really overreached with that project. But still, it was a really pathetic showing.
Edit: I switched from “I” to “we” here because I was working with a classmate, but we did use my code. She did a lot of the heavy lifting in getting the games parsed into a form where the network could train on it, though.
business majors are the worst i swear to god
They are literally what’s causing the fall of our society.
Objectively, per Ed Zitron.
Didn’t you know? Being adept at business immediately makes you an expert in many science and engineering fields!
adept
I think you’re giving them a little too much credit there
My wife is a business major.
I always tell her that the enemy is in my bed.
(I have no clue why she does not think that this is funny. ;))
I have personally told coworkers that if they train a custom GPT, they should put “AI expert” on their resume as it’s more than 99% of people have done - and 99% of those people didn’t do anything more than tricked ChatGPT into doing something naughty once a year ago and now consider themselves “prompt engineers.”
Absolutely agree there
Hell, I’ve been training models and using ML directly for a decade and I barely know what’s going on in there.
Outside of low dimensional toy models, I don’t think we’re capable of understanding what’s happening. Even in academia, work on the ability to reliably understand trained networks is still in its infancy.
NONE of them knows what’s going on inside.
We are right back in the age of alchemy, where people talking latin and greek threw more or less things together to see what happens, all the while claiming to trying to make gold to keep the cash flowing.
The image feels like “Those who know 😀 Those who don’t know 😬”
And the number of us who build these models from scratch, from the ground up, even fewer.
I’ve been selling it even longer than that and I refuse to use the word expert.
This image is clearly of my hands with an elastic band at the back of class two decades ago
Yeah but why am I arguing with them?
Maybe it’s because they were stretching.
It was the same with crypto TBH. It was a neat niche research interest until pyramid schemers with euphemisms for titles got involved.
With crypto, it was largely MLM scammers who started pumping it (futily, for the most part) until Ross Ulrich and the Silk Road leveraged it for black market sales.
Then Bitcoin, specifically, took off as a means of subverting bank regulations on financial transactions. This encouraged more big-ticket speculators to enter the market, leading to the JP Morgan sponsorship of Etherium (NFTs were a big part of this scam).
There’s a whole historical pedigree to each major crypto offering. Solana, for instance, is tied up in Howard Lutnick’s play at crypto through Cantor Fitzgerald.
Interesting.
I guess AI isn’t so dissimilar, with major ‘sects’ having major billionaire/corporate backers, sometimes aiming for specific niches.
Anthropic was rather infamously funded by FTX. Deepseek came from a quant trading (and to my memory, crypto mining) firm, and there’s loose evidence the Chinese govt is ‘helping’ all its firms with data (or that they’re sharing it with each other under the table, somehow). Many say Zuckerberg open-sourced llama to ‘poison the well’ over OpenAI going closed.
Silk Road and other black market vendors existed well before the scams started. You could mail order drugs online when bitcoin was under $1, the first bubble pushed the price to $30 before crashing to sub-$1 again. THEN the scams and market manipulation took off.
Later people forked the project to create new chains in order to run rug pulls and other modern crypto scams.
Silk Road and other black market vendors existed well before the scams started
Silk Road was launched in 2011, the same year of the first big Mt. Gox crypto heist (now largely recognized as an inside job).
Crypto scams are as old as Bitcoin itself.
OK but what actually is this image?
Basic model of a neural net. The post is implying that you’re arguing with bots.
https://en.wikipedia.org/wiki/Neural_network_(machine_learning)
Wouldn’t a bot recognize this though?
A bot might, but this post is pointing out how common it is for people who consider themselves AI experts to not recognize this diagram that is basically part of AI 101
They’re not saying that the bots are asking what the image is, but users (may be bots or not) that sell themselves as AI/ML experts.
they’re just robots Morty!
Would you recognize if someone made a block diagram of your brain?
Illustration of a neural network.
The simplest neural network (simplified). You input a set of properties(first column). Then you weightedly add all of them a number of times(with DIFFERENT weights)(first set of lines). Then you apply a non-linearity to it, e.g. 0 if negative, keep the same otherwise(not shown).
You repeat this with potentially different numbers of outputs any number of times.
Then do this again, but so that your number of outputs is the dimension of your desired output. E.g. 2 if you want the sum of the inputs and their product computed(which is a fun exercise!). You may want to skip the non-linearity here or do something special™
To elaborate: the dots are the simulated neurons, the lines the links between neurons. The pictured neural net has four inputs (on the left) leading to the first layer, where each neuron makes a decision based on the input it recieves and a predefined threshold, and then passes its answer on to the second layer, which then connects to the two outputs on the right
Many player cat’s cradle
Logic.
Wait till you talk to LinkedIn people interested in Quantum Physics

Probably bc they forgot the bias nodes
(/s but really I don’t understand why no one ever includes them in these diagrams)
Same as if you’d ask a crypto bro how a blockchain actually works. All those self proclaimed Data Scientists who were able to use pytorch once successfully by following a tutorial, just don’t want to die.
Glorious.
I’ve never had it well explained why there are (for example , in this case) two intermediary steps, and 6 blobs in each. That much has been a dark art, at least in the “intro to blah blah” blogposts.
Being the devil’s advocate here.
Do you guys really understand all of your tools down to the technical level? People can make good use of AI/LLM without the need of understanding NN, weights and biases. The same way as I make good use of a microwave or a rangefinder without understanding the deep levels of electromagnetic waves and so on. Fun meme tho.
This is like claiming to be working as an electrician and not knowing how electricity works.
Someone who uses AI to code or make images isn’t doing machine learning anymore than a pilot is doing aerospace engineering. And someone claiming to be an aerospace engineer can’t say that they don’t understand fluid dynamics.
If someone is claiming to be in the machine learning field, not recognizing a fundamental technique of machine learning is a dead giveaway that they’re lying. This kind of diagram is used in introductory courses for machine learning, anyone with any competence in the field would know what it was.
Sure but if you make your living with microwaves somehow you should know what a magnetron is/be able to recognize one. You don’t have to know exactly how it works but like… This is fundamental stuff.
This is an extremely basic intro level ML topic. If you cannot even identify a fully connected network (or “MLP”) then you don’t know anything about the subject. You don’t need to know how to hand compute a back prop iteration to know what this is.




















