"Links No Longer Mean Credibility" - did they used to? I mean, I mostly agree with this article but a person could have written this about the internet. I remember people linking to all sorts of random web pages and using that as a source of credibility.
Late last year I tried asking ChatGPT to summarize a collection of 10 researchers' views/findings on a topic and provide representative quotes. It initially looked plausible but when I checked the links, the quotes were from clearly AI generated summaries of actual interviews. The paraphrasing was also plausible but subtly and profoundly incorrect.
I haven't tested this again on the latest models though, so not sure if there's been an improvement.
That's more or less how it works. To actually have the system carry out your intention it would have to use significant hardware resources (and even then who knows if it would actually work). Alternatively you would need to break up the work into chunks that the hardware allocated to you by the system would not be overwhelmed.
A lot of people don't realize this because the work that they are having the AI do does not need to be either true or false. It just has to output media that seems like it fits. The system probably took many shortcuts to keep the resource use low while outputting something plausible but false.
And frankly this is sort of fine as long as you know what it's doing and what the limitations are. Hypothetically if you broke up the task into multiple steps that the system can actually ingest properly it might reduce the time that the task took overall, maybe even significantly, but not down to one prompt.
You always have to check your sources because citation laundering is a thing[0].
In addition, most mainstream[1] journalists cite sources in a more liberal way than a scientist should so the source might not say what the journalist reports. The Atlantic has a bit on Waymo’s poor detection of minorities[2], e.g.
People like to blame social media for this kind of bullshit, but social media is just the vector.
Just this week I read a "study" because someone claimed on social media that it was made by (Public, famous) Unis A, B and C and reported as an effect an increase in 30% of revenue for the companies that participated in the experiment.
The "study" was commissioned by an interest group (bad sign). It was conducted by people associated with said unis (I didn't check their credentials), and it did report in its headline the 30% revenue increase.
Said study was about an experiment that ran for a few months. Within these months, the revenue was flat (which could be considered good enough for the cause). The 30% was the revenue of this period against the same period the previous year. So somehow the experiment affected the companies retroactively! Not to mention that the researchers were able to find a group of companies that were, on average, growing 30% YoY. Surprising indeed.
So even if you check your sources, it may still be bullshit science or bullshit reporting from well-credentialed sources.
When Bard (now known as Gemini) first came out in Europe, I think mid 2023, I tested it out. AI search was still a new thing in those days, and I was excited to see what Google's solution would be like. I had high hopes.
I asked it a question I knew the answer to. It searched the web, and told me the opposite of the truth. (Not nonsense, but a logical inversion of the actual fact. A common failure mode with earlier LLMs.)
Puzzled, I checked the sources. It cited two. Both AI SEO slop.
Bizarrely, I Googled it myself and couldn't even find those pages on Google. Maybe it was using a different search engine? ;)
Just as an aside jumping off this sentence from the article, I am far less tolerant of the practice of naming countries of origin or general locales rather than specific organizations in headlines and stories.
Name the organization, and if you want to in the body, name where they’re from/located/operating as it pertains to the organization. For that matter, if you can offer information on the specific locale (Sweden is a big place after all), you should also do that unless it really is something more national/international.
There's even arguments for doing this even for cases where the actual state entity did something.
"The US did X" The president? The senate? A federal, or municipal body? etc..
But there's arguments against, if "The US bans automatic rifles" then to some extent it's clear what part of the US did it, to some other extent, it doesn't matter, and to some other extent, the part of the country that did the thing represents the whole country by corporization or democritazation.
In History it's very common to say Country did thing, "Germany invaded Poland", "Argentina signed the Roca-Runceman pact" and so on... Possibly because (in addition to the reasons stated above) information needs to be compressed more for the past, we have less space and priority for details of the past than we do for the present, a kind of cold-hot storage mechanism
Facebook, ever the wasteland of bullshit and scams, has gotten even more bullshit and scammy in the AI era.
I have found the single best way to avoid being pissed off by this shit is to just avoid Facebook. It dramatically cuts down on the amount I am exposed to.
I also run with adblockers, and consume news via brutalist.report, which also helps. (I avoid the Fox News section at the bottom)
Not just Facebook, but also make sure to avoid TikTok, Instagram and YouTube, along with YouTube Shorts. Many of them are just nothing but fake AI content, and these days people are using AI to create fake profiles of good-looking, cute girls doing impossible things or actually showing off their bodies, and so on. At least 50% of what you see on your feed should be considered AI-generated content.
I would say save your time and energy, and invest that into something else - forget all this social media.
I don't do TikTok. My instagram feed doesn't seem too bad by comparison.
Youtube shorts also seem OK for me, but of course definitely elevated compared to regular videos recommended to me.
Lastly:
> I would say save your time and energy, and invest that into something else - forget all this social media.
Agree. The promise of social media hasn't worked out. It was nice during the early Netflix streaming days, but has gotten progressively worse since then.
I mostly disagree with this. You can request sources, you can ask it to check, but no LLM I have used can do this correctly more than 50-75% of the time, and some of the major models are extremely bad at this: giving broken links 90% of the time, incapable of giving actual links rather than search engine links, etc. Constant supervision and repetition of requests can sometimes get results, but it is exhausting. The "sources" it finds are often Reddit posts or other questionable secondary or tertiary sources, not actual original sources.
There's nuance to that. An LLM is quite capable of suggesting relevant reading, given the context. Especially when the context is broad enough that there's enough training data.
"Find me research on code reviews, their size, and quality" would give you more than enough reading. Yet, if you start with a claim, like "Longer PRs mean worse defect detection," the relevant data points fall to few enough for AI to start hallucinating.
You get "something, something, PR length, defect detection, IDK, I don't read research papers." Such output is fine as long as the author cares to validate it.
Skip the second step, and you might be good if you ask about something generic, like "What's the Slack story?" or "How did Blockbuster go bust?" Ask about some specific details, though, and you're bound to end up with made-up stuff that sounds just about right, while it's actually wrong.
Checking is different from finding, though. Source checking means just "verify that this information is actually present in that document". Much harder to hallucinate in this case.
"Follow each link in this document. Read each link's contents against the contents in this document. Create a report: for each link list a working hyperlink, whether it exists, what claim it supports, whether it supports or fails to support it, and why"
If it returns a report claiming all correct? That's promising, but human verification is important. You've got a list of hyperlinks, and a list of claims; so you can click each with middle-mouse, Ctrl-F 'till you find the point, and close the tab when you do.
If you find any discrepancies ? Your initial prompt was malformed and/or you picked the wrong LLM, the wrong human, or possibly all three. Whatever the way, the results are built on quicksand; you'll need to start over.
If no sources are provided? Well now: "If there ain't no sources it never happened."
Compare double-entry bookkeeping. It needs to all add up. If you're 1 cent off, that means something is broken. Idem if a single reference is off, it polluted the context. (This works for human-generated and hybrid documents too. Polluted reasoning is polluted reasoning. The process is what counts.)
Have we forgotten how bad LLMs were at citing sources when they first came out? So, we had to build a lot of structure (harness engineering) and frontier labs had to do specific training to try to compensate for this.
So, LLMs are inherently bad at citing sources. A lot of effort has been put in to improve this behavior, but it's compensating for an inherent flaw.
I disagree. It is a bullshit machine all the way to the core. LLMs in my world fail to cite full sources and consistently conclude with guesses as facts. It does this much more than an average journalist or reporter would. Only when you double-check it will it then apologize and correct itself.
Personal experience? You ask it for the name of the paper referenced. You google that paper (for some reason it's not great at going out and acquiring the paper). You then upload the pdf and ask it if the paper supports the assertion if it's not quickly findable via ^F. You go read, ask it clarifying questions about hazard ratios, what they controlled for, etc.
Ultimate credibility? Sure, they never did. Yet the whole thing Google was built upon was using links as tokens of credibility.
You'd assume an outgoing link from a CNN website has more credibility than one from an anonymous blog. That is, I reckon, still true. Although the credibility either link conveys is degrading. Again, it has been so since we started playing the game of SEO, yet AI-generated content in this context is basically a weapon of mass destruction. The deterioration has sped up dramatically.
> Ops, the link doesn’t lead to the study, but to another article. But that article, in turn, has a link of its own. Which leads to yet another article that doesn’t even mention the study anymore.
This is a common, infuriating practice: provides a veneer of authoritativeness and credibility to newspaper articles, and who is ever going to click on the links that support those very cogent claims? Nobody of course, so they just link to another article with more vague claims, and at any further level deep your willingness to verify that information evaporates at the same rate as the information itself.
But hey, in the meanwhile the author has managed to sneak in that "scientists have found" and that if you don't believe it you must be anti-science.
Incidentally, highlighting this abuse (together with a bunch of other quality and fact-checking) would be a great use of AI on online news publication.
It's amazing that people think Snopes or other "fact-checkers" are reliable sources of information and represent ultimate truth, as if they're immune to bias and don't receive funding from people / organizations with their own agendas.
They are generally quite good, and they provide ample background info for you to replicate (or repudiate) their findings on your own if you're so inclined.
What's amazing is that people think Snopes or other fact-checkers are automatically wrong. I assume this comes from people who make a habit of believing bullshit and can't handle being corrected.
There's a plethora of examples on the internet of Snopes engaging in this type of behavior, if you're actually interested in learning about their problematic approach to their work.
Also relevant: the derision and mockery directed at JD Vance as a “couch fucker” even used by John Oliver.
I read “Hillbilly Elegy” and wondered why it wasn’t in there. Snopes cleared it up in a matter of minutes. Why he hasn’t sued people into oblivion is his prerogative, but it’s a fascinating case study that we are, indeed, living in a Post-Truth environment.
There was a time, in the early to mid 2010s, when the phrase "Fake News" was almost exclusively used by people in publishing to talk about a very real rise in editorial disruption as news readers shifted from being desktop and homepage-driven to mobile and facebook-driven.
And then, one day, the politicians started saying it...
Did anyone actually believe that was anything more than a joke? It was a disgusting and weird thing to suggest about a disgusting and weird guy, and highly immature, but it's only libel if it's presented as being true.
Interesting that you focus on John Oliver's bit considering that it came up in the context of JD Vance doubling down on the whole "they're eating the cats and dogs thing".
Tucker Carlson set the precedent when he was sued for libel by Karen McDougal and won because Fox New lawyers successfully argued he wasn't a reporter and no reasonable person would believe he's stating facts.
Unless he's repeating Trump's lies, then 77M people apparently believe it.
You're getting downvotes because the target of this particular lie was a known liar, so people probably feel like it's some sort of poetic justice (or they know it's just in-kind retaliation and are cathartically satisfied by it).
I don't think the right answer to widespread disinformation campaigns is retaliatory disinformation campaigns (even if they're couched – pun not intended – in a just-barely-thin-enough veil of "wink wink we know this is a joke").
The right answer is to create systems and measures that actually limit disinformation.
I’m with you. The net effect actually is something akin to honking one’s horn at a guy who honked at you. You think you’re giving him a taste of his own medicine, but walking by I only see two people honking their horn and I’d ideally prefer not to be around the horn honkers since they’re unpleasant.
Purveyors of post-truth lies don’t turn around and sue people. They just peddle more lies, this is the kind of environment scum like the Vance’s live for.
"Links No Longer Mean Credibility" - did they used to? I mean, I mostly agree with this article but a person could have written this about the internet. I remember people linking to all sorts of random web pages and using that as a source of credibility.
Late last year I tried asking ChatGPT to summarize a collection of 10 researchers' views/findings on a topic and provide representative quotes. It initially looked plausible but when I checked the links, the quotes were from clearly AI generated summaries of actual interviews. The paraphrasing was also plausible but subtly and profoundly incorrect.
I haven't tested this again on the latest models though, so not sure if there's been an improvement.
That's more or less how it works. To actually have the system carry out your intention it would have to use significant hardware resources (and even then who knows if it would actually work). Alternatively you would need to break up the work into chunks that the hardware allocated to you by the system would not be overwhelmed.
A lot of people don't realize this because the work that they are having the AI do does not need to be either true or false. It just has to output media that seems like it fits. The system probably took many shortcuts to keep the resource use low while outputting something plausible but false.
And frankly this is sort of fine as long as you know what it's doing and what the limitations are. Hypothetically if you broke up the task into multiple steps that the system can actually ingest properly it might reduce the time that the task took overall, maybe even significantly, but not down to one prompt.
You always have to check your sources because citation laundering is a thing[0].
In addition, most mainstream[1] journalists cite sources in a more liberal way than a scientist should so the source might not say what the journalist reports. The Atlantic has a bit on Waymo’s poor detection of minorities[2], e.g.
0: https://wiki.roshangeorge.dev/w/Blog/2026-01-17/Citogenesis
1: Some independent reporters like Matt Yglesias are more rigorous, though their direct reporting can still be bogus
2: https://www.theargumentmag.com/p/no-waymos-arent-racist
People like to blame social media for this kind of bullshit, but social media is just the vector.
Just this week I read a "study" because someone claimed on social media that it was made by (Public, famous) Unis A, B and C and reported as an effect an increase in 30% of revenue for the companies that participated in the experiment.
The "study" was commissioned by an interest group (bad sign). It was conducted by people associated with said unis (I didn't check their credentials), and it did report in its headline the 30% revenue increase.
Said study was about an experiment that ran for a few months. Within these months, the revenue was flat (which could be considered good enough for the cause). The 30% was the revenue of this period against the same period the previous year. So somehow the experiment affected the companies retroactively! Not to mention that the researchers were able to find a group of companies that were, on average, growing 30% YoY. Surprising indeed.
So even if you check your sources, it may still be bullshit science or bullshit reporting from well-credentialed sources.
Why not link the study?
When Bard (now known as Gemini) first came out in Europe, I think mid 2023, I tested it out. AI search was still a new thing in those days, and I was excited to see what Google's solution would be like. I had high hopes.
I asked it a question I knew the answer to. It searched the web, and told me the opposite of the truth. (Not nonsense, but a logical inversion of the actual fact. A common failure mode with earlier LLMs.)
Puzzled, I checked the sources. It cited two. Both AI SEO slop.
Bizarrely, I Googled it myself and couldn't even find those pages on Google. Maybe it was using a different search engine? ;)
> Not Sweden, but one Swedish startup.
Just as an aside jumping off this sentence from the article, I am far less tolerant of the practice of naming countries of origin or general locales rather than specific organizations in headlines and stories.
Name the organization, and if you want to in the body, name where they’re from/located/operating as it pertains to the organization. For that matter, if you can offer information on the specific locale (Sweden is a big place after all), you should also do that unless it really is something more national/international.
There's even arguments for doing this even for cases where the actual state entity did something.
"The US did X" The president? The senate? A federal, or municipal body? etc..
But there's arguments against, if "The US bans automatic rifles" then to some extent it's clear what part of the US did it, to some other extent, it doesn't matter, and to some other extent, the part of the country that did the thing represents the whole country by corporization or democritazation.
In History it's very common to say Country did thing, "Germany invaded Poland", "Argentina signed the Roca-Runceman pact" and so on... Possibly because (in addition to the reasons stated above) information needs to be compressed more for the past, we have less space and priority for details of the past than we do for the present, a kind of cold-hot storage mechanism
Facebook, ever the wasteland of bullshit and scams, has gotten even more bullshit and scammy in the AI era.
I have found the single best way to avoid being pissed off by this shit is to just avoid Facebook. It dramatically cuts down on the amount I am exposed to.
I also run with adblockers, and consume news via brutalist.report, which also helps. (I avoid the Fox News section at the bottom)
Not just Facebook, but also make sure to avoid TikTok, Instagram and YouTube, along with YouTube Shorts. Many of them are just nothing but fake AI content, and these days people are using AI to create fake profiles of good-looking, cute girls doing impossible things or actually showing off their bodies, and so on. At least 50% of what you see on your feed should be considered AI-generated content.
I would say save your time and energy, and invest that into something else - forget all this social media.
My favorite is the 83-year-old AI grandma on youtube giving retirement advice. They put her in various settings and she looks and sounds very real.
The only obvious tell is the eyes don't track right. But once they fix that, it's really going to be hard to know.
All the comments are how great her advice is, etc.
Every video has a link to a book she "wrote" on amazon. I didn't waste my time trying to figure out what the scam is.
I don't do TikTok. My instagram feed doesn't seem too bad by comparison.
Youtube shorts also seem OK for me, but of course definitely elevated compared to regular videos recommended to me.
Lastly:
> I would say save your time and energy, and invest that into something else - forget all this social media.
Agree. The promise of social media hasn't worked out. It was nice during the early Netflix streaming days, but has gotten progressively worse since then.
Ironically, 'source checking' is something AI is quite good at.
I mostly disagree with this. You can request sources, you can ask it to check, but no LLM I have used can do this correctly more than 50-75% of the time, and some of the major models are extremely bad at this: giving broken links 90% of the time, incapable of giving actual links rather than search engine links, etc. Constant supervision and repetition of requests can sometimes get results, but it is exhausting. The "sources" it finds are often Reddit posts or other questionable secondary or tertiary sources, not actual original sources.
There's nuance to that. An LLM is quite capable of suggesting relevant reading, given the context. Especially when the context is broad enough that there's enough training data.
"Find me research on code reviews, their size, and quality" would give you more than enough reading. Yet, if you start with a claim, like "Longer PRs mean worse defect detection," the relevant data points fall to few enough for AI to start hallucinating.
You get "something, something, PR length, defect detection, IDK, I don't read research papers." Such output is fine as long as the author cares to validate it.
Skip the second step, and you might be good if you ask about something generic, like "What's the Slack story?" or "How did Blockbuster go bust?" Ask about some specific details, though, and you're bound to end up with made-up stuff that sounds just about right, while it's actually wrong.
Checking is different from finding, though. Source checking means just "verify that this information is actually present in that document". Much harder to hallucinate in this case.
A quick smoke check takes just a few minutes.
"Follow each link in this document. Read each link's contents against the contents in this document. Create a report: for each link list a working hyperlink, whether it exists, what claim it supports, whether it supports or fails to support it, and why"
If it returns a report claiming all correct? That's promising, but human verification is important. You've got a list of hyperlinks, and a list of claims; so you can click each with middle-mouse, Ctrl-F 'till you find the point, and close the tab when you do.
If you find any discrepancies ? Your initial prompt was malformed and/or you picked the wrong LLM, the wrong human, or possibly all three. Whatever the way, the results are built on quicksand; you'll need to start over.
If no sources are provided? Well now: "If there ain't no sources it never happened."
Compare double-entry bookkeeping. It needs to all add up. If you're 1 cent off, that means something is broken. Idem if a single reference is off, it polluted the context. (This works for human-generated and hybrid documents too. Polluted reasoning is polluted reasoning. The process is what counts.)
I would love to do it at scale on many online publications, and publish the results. That would teach 'em.
Have we forgotten how bad LLMs were at citing sources when they first came out? So, we had to build a lot of structure (harness engineering) and frontier labs had to do specific training to try to compensate for this.
So, LLMs are inherently bad at citing sources. A lot of effort has been put in to improve this behavior, but it's compensating for an inherent flaw.
I disagree. It is a bullshit machine all the way to the core. LLMs in my world fail to cite full sources and consistently conclude with guesses as facts. It does this much more than an average journalist or reporter would. Only when you double-check it will it then apologize and correct itself.
Judging by the number of scientific papers that have been outed as AI-generated, precisely because it hallucinated sources, it's not
Citation needed, please
Personal experience? You ask it for the name of the paper referenced. You google that paper (for some reason it's not great at going out and acquiring the paper). You then upload the pdf and ask it if the paper supports the assertion if it's not quickly findable via ^F. You go read, ask it clarifying questions about hazard ratios, what they controlled for, etc.
AI is quite good when grounded in a source.
>A clearly AI-generated image didn’t help the credibility (a three-legged crow is quite telling)
Actually I checked some sources, and I found some for three-legged crows:
https://en.wikipedia.org/wiki/Kojiki#The_Nakatsumaki_(%E4%B8...
https://en.wikipedia.org/wiki/Three-legged_crow#/media/File:...
https://en.wikipedia.org/wiki/File:Douze_emblemes_des_rites_...
https://en.wikipedia.org/wiki/File:Chengdu_2007_341.jpg
And by refuting this article, I thereby prove that which it sought to refute.
> Links No Longer Mean Credibility
They never did!
Ultimate credibility? Sure, they never did. Yet the whole thing Google was built upon was using links as tokens of credibility.
You'd assume an outgoing link from a CNN website has more credibility than one from an anonymous blog. That is, I reckon, still true. Although the credibility either link conveys is degrading. Again, it has been so since we started playing the game of SEO, yet AI-generated content in this context is basically a weapon of mass destruction. The deterioration has sped up dramatically.
> Ops, the link doesn’t lead to the study, but to another article. But that article, in turn, has a link of its own. Which leads to yet another article that doesn’t even mention the study anymore.
This is a common, infuriating practice: provides a veneer of authoritativeness and credibility to newspaper articles, and who is ever going to click on the links that support those very cogent claims? Nobody of course, so they just link to another article with more vague claims, and at any further level deep your willingness to verify that information evaporates at the same rate as the information itself.
But hey, in the meanwhile the author has managed to sneak in that "scientists have found" and that if you don't believe it you must be anti-science.
Incidentally, highlighting this abuse (together with a bunch of other quality and fact-checking) would be a great use of AI on online news publication.
It's amazing that people think Snopes or other "fact-checkers" are reliable sources of information and represent ultimate truth, as if they're immune to bias and don't receive funding from people / organizations with their own agendas.
Snopes (like anywhere) is only as reliable as its track record of collecting firsthand sources and accurately reporting on their contents.
Which is to say: pretty good so far, in their case. For the future? Who knows. But they've done well up to now, at least.
Actually no, their track record is not great: https://en.wikipedia.org/wiki/Snopes#2010s
They are generally quite good, and they provide ample background info for you to replicate (or repudiate) their findings on your own if you're so inclined.
What's amazing is that people think Snopes or other fact-checkers are automatically wrong. I assume this comes from people who make a habit of believing bullshit and can't handle being corrected.
When there is no independent media, it's not difficult to find sources that back up the lies that Snopes and other fact-checkers peddle.
https://fair.org/home/the-digital-media-oligarchy-who-owns-o...
https://swprs.org/the-american-empire-and-its-media/
What lies exactly?
https://www.dailymail.com/news/article-4730092/Snopes-brink-...
https://foodbabe.com/do-you-trust-snopes-you-wont-after-read...
There's a plethora of examples on the internet of Snopes engaging in this type of behavior, if you're actually interested in learning about their problematic approach to their work.
Also relevant: the derision and mockery directed at JD Vance as a “couch fucker” even used by John Oliver.
I read “Hillbilly Elegy” and wondered why it wasn’t in there. Snopes cleared it up in a matter of minutes. Why he hasn’t sued people into oblivion is his prerogative, but it’s a fascinating case study that we are, indeed, living in a Post-Truth environment.
There was a time, in the early to mid 2010s, when the phrase "Fake News" was almost exclusively used by people in publishing to talk about a very real rise in editorial disruption as news readers shifted from being desktop and homepage-driven to mobile and facebook-driven.
And then, one day, the politicians started saying it...
Oliver in that clip literally calls the couch-fucking thing "the fun kind of misinformation". He's not suggesting it's true.
Did anyone actually believe that was anything more than a joke? It was a disgusting and weird thing to suggest about a disgusting and weird guy, and highly immature, but it's only libel if it's presented as being true.
Interesting that you focus on John Oliver's bit considering that it came up in the context of JD Vance doubling down on the whole "they're eating the cats and dogs thing".
https://youtu.be/NtRPLCso0Sw?t=14m09s
Makes me believe that you're really not commenting in good faith here.
Tucker Carlson set the precedent when he was sued for libel by Karen McDougal and won because Fox New lawyers successfully argued he wasn't a reporter and no reasonable person would believe he's stating facts.
Unless he's repeating Trump's lies, then 77M people apparently believe it.
You're getting downvotes because the target of this particular lie was a known liar, so people probably feel like it's some sort of poetic justice (or they know it's just in-kind retaliation and are cathartically satisfied by it).
I don't think the right answer to widespread disinformation campaigns is retaliatory disinformation campaigns (even if they're couched – pun not intended – in a just-barely-thin-enough veil of "wink wink we know this is a joke").
The right answer is to create systems and measures that actually limit disinformation.
I’m with you. The net effect actually is something akin to honking one’s horn at a guy who honked at you. You think you’re giving him a taste of his own medicine, but walking by I only see two people honking their horn and I’d ideally prefer not to be around the horn honkers since they’re unpleasant.
Purveyors of post-truth lies don’t turn around and sue people. They just peddle more lies, this is the kind of environment scum like the Vance’s live for.