The replies here arguing we should publish it all are wild in the worst kind of first-order thinking way.
It’s a census: it just asks questions.
If you start publishing and weaponizing the data against people with various attributes, they’ll just lie or not answer. And then you are left with worse than nothing: bad data people try to act on.
The US Government is the entity that weaponizes the data. The most obvious example is the Census Bureau compiling lists of people of Japanese descent to imprison during WWII. That's just the most obvious one that I know of without looking up more.
The real push for this now is to form lists of people to disenfranchise.
and implicitly force them to sell the land they own for less then it's worth, which in combination with setting up very messed up tax related laws in some states (1) which highly benefit you if you bought land longer in the past effectively "killed" a budding, wealthy, land owning Asian community and made sure it can't really regrow in that form.
(1): I think it was mainly California, but don't remember full
You should carefully examine the evidence you have supporting that belief. Start by observing that this is a partisan issue in which the official positions of the two major parties disagree on a factual claim, not merely the policy. A disagreement on policy can (sometimes) be chalked up to a difference in values, even though those do sometimes arise downstream of factual incorrectness. But a disagreement on facts is one with in which someone is right and someone is wrong. (Or, more complicatedly, someone is closer to accurate and someone is cherry-picking.)
If what you believe to be true is in fact true, then you should be able to comfortably go searching for evidence to falsify it and support the alternative, and fail to find such evidence, confident in your assumption that you won't find it. Either way, I hope that you desire to find the correct answer rather than the one that would be convenient for your political position.
> they are doing everything they can to ensure that only the people who are legally allowed to vote are the ones voting
Illegal voting is so rare that almost every time folks go looking for it they come up empty handed. Examples of voter suppression, on the other hand, are trivial to fine. (And both parties do it, particularly around primaries.)
In my state, we’re trying to enact a citizenship-proof requirement which penalizes women who change their name on getting married and those who can’t afford a passport. In effect, a marriage and poll tax. Ironically, this will disenfranchise the MAGA voters who are themselves pushing for it, but I’m not really going to point that out aggressively.
(That said, a legitimate fraction of American politics right now is in convincing the other side’s likely voters that elections are rigged, the oligarchs are in charge, why even bother calling your electeds or voting, eat an ice-cream sundae and talk to your AI girlfriend.)
Turnout in the past few elections was already extraordinarily high. Clearly this isn’t the answer. And neither is protesting, considering that 4 of the 5 largest demonstrations in US history happened in the past 10 years and achieved nothing.
Ranked Choice Voting makes it easier to vote for “less bad” candidates.
RCV also tends to work against polarization, since it rewards candidates who are at least acceptable to a broad swath of the electorate.
It may not be the “answer” for all that ails the American political system, but it would help.
ETA: Unlike many other reforms it's also doable within the constraints of the current constitutional order and is hard for SCOTUS to torpedo (though I suppose I shouldn't underestimate SCOTUS).
Approval Voting would be an easier pill to swallow for most americans. It’s hard to explain “yeah Trump got the most #1 votes but still lost” and easy to explain “this other candidate got the most checkmarks”.
You also need the option to vote for "less-bad" people. Where I am now, my vote doesn't matter, even if it means the "less-bad" people win with no competition (as opposed to where I moved from where things were skewed the opposite way).
So if you're in a heavily red state but you're blue, then vote in the primaries for the more centered Republican. If you're in a heavily blue state but red, do the opposite. Either way this actually helps because more people are centered and we're getting wilder and wilder candidates because there's increased tribalism. They go to the extremes because they get more voters that way. They figured out that the mainstay voters will just end up voting left or right regardless, and that by catering to the extremes it actually pulls the mainstream voters too. (Both Reps and Dems are using this strategy)
Remember, you don't have to vote for the person you actually like.
And keep doing this until we get a sane voting system which can embed actual preference (any of the cardinal systems: i.e. Approval or STAR). This strategy still works with ordinal systems (i.e. Ranked Choice) because a weak spoiler is still really good at splitting the vote (happened in a pretty famous Alaska election).
> Turnout in the past few elections was already extraordinarily high.
In a sense, this in itself is the issue. It's long-term _worse_ to vote for the "lesser of two serious evils". This extreme "long-term pain for short-term gain" attitude is what's gotten the US to where it is. If in 2016 of 2024 even 20% of the dems would've stayed home or voted third party, the DNC's continuous forcing of awful corpocrats with zero charisma would've become completely untenable and Trump would've been limited to one term. Yet instead they were rewarded for it, so you'll see Newsom get the candidacy and presidency in 2028 (if 2028 even happens at this point), and then in 2032 you'll get something like Hegseth or Thiel winning and it's all over.
There is an answer: relentlessly vote, but only for candidates who are actually slightly decent - including third-party - and otherwise stay at home. "Relentlessly" means "at every level", including locally from the very bottom, all the way up.
The whole idea of "third-party voting is a complete waste in the US" is incredibly dumb because a vote for someone who loses isn't a wasted vote. It shows the others that there's a voter there who can be convinced if catered to, if they select a better candidate. The powers that be have done a fantastic job of brainwashing the entire population of the myth that anyone who _doesn't_ go out and vote for either major candidate is a morally bankrupt person, because it directly benefits them.
The reply to this will be "well it's too late for that now!". It's wrong because the alternative doesn't help you one bit. You're just wishing for a miracle, that in 4 years something happens, kicking the can down the road making things worse long term. And that's actually what's got you here.
It's a symptom of the terminal disease which has infected all layers of American society and has gotten it to where it's at: short-termism. Everyone just looks at the next quarter, the next election. China's ascendency is 1:1 tied to doing the exact opposite. Some smartypants will now point "but zero Covid", great you found a potential exception, now look at the other 90% of policy.
Every time I've explained this I've gotten instantly downvoted without a single reply making an argument against it, because it's too painful for people to admit that they've been part of the road to where the US is at. And again, short-termism: rather feel the short-term tiny dopamine hit by slamming that downvote button than thinking about it. Let's see if this happens again.
> The whole idea of "third-party voting is a complete waste in the US" is incredibly dumb because a vote for someone who loses isn't a wasted vote.
Yes, but with a caveat, if you had a strong preference between the top two actually-likely-to-win candidates (assuming the third party wasn't competitive), it's at least not voting the most in your interests for the outcome. Which is why we really need approval voting, so we can actually vote for the candidates we like, rather than needing to "strategically" hold our noses.
But I agree with the rest of it, if none of the candidates represent you, the third-party vote at least allows you to send a signal of "I vote, but you need to make me want to vote for you, and this is what I want".
> Which is why we really need approval voting, so we can actually vote for the candidates we like, rather than needing to "strategically" hold our noses.
Approval voting would not end strategic voting.[1]
> Every time I've explained this I've gotten instantly downvoted without a single reply making an argument against it,
Ok I'll break it down for you.
> If in 2016 of 2024 even 20% of the dems would've stayed home or voted third party
Parties cater to their bases, and putting yourself out there as an unreliable voting bloc is exactly how you get your demands ignored.
> The whole idea of "third-party voting is a complete waste in the US" is incredibly dumb
It's not incredibly dumb, it's simple mathematical reality. This doesn't change unless the first past the post system changes. Why do you think the GOP backs the Green Party?
> 2024 [...] the DNC's continuous forcing of awful corpocrats with zero charisma would've become completely untenable and Trump would've been limited to one term.
You mean the 2024 election cycle where incumbents all around the globe were beaten because the economic situation was strongly anti-incumbent? Are you positing that the US election was somehow a unique outlier and solely down to Harris being the Democrat candidate? Even though a swing of 115k votes would have handed the presidency to Harris instead?
It sounds like you have a particular issue with the 2016 and 2024 elections and I'm wondering if there's something in common that might explain it...
Hillary Clinton won the popular vote. People didn't want Trump in 2016, they wanted her. But he won the electoral college,
He won the popular vote in 2024, but the tight margin in the electoral college suggests a democratically elected Democratic candidate (i.e. one selected by a primary, not one appointed by the sitting president) could have won instead. Other potential candidates were polling better than Harriss. I personally think Gretchen Whitmer could have successfully distanced herself from the Biden administration and defeated Trump.
> Every time I've explained this I've gotten instantly downvoted
Because it's dumb. People don't want to hear dumb ideas, or take the time to try and convince someone that would spend however long it took to type that, apparently multiple times, without realizing it. Throwing away votes will never be the reasonable thing to do. I know you don't want to hear that, because it's too painful for you to admit there's no simple answer.
> The whole idea of "third-party voting is a complete waste in the US" is incredibly dumb because a vote for someone who loses isn't a wasted vote. It shows the others that there's a voter there who can be convinced if catered to, if they select a better candidate.
Tried that in 2000, voting for Nader as a protest vote against Clinton/Gore third way neoliberalism. I did that in a state where the electoral votes for Dems were 100% safe. Still just got blamed for Bush and there was zero self-reflection on the part of the Democratic Party.
...
I would urge everyone to stop fixating on the Presidential vote as the only fight to win and everything being win/lose based on that outcome. If the Congressional Progressive Caucus in the House exceeds 50% of Democrats in the House, then we can start thinking about a world where e.g. AOC might be the speaker of the House rather than Nancy Pelosi.
> It's a symptom of the terminal disease which has infected all layers of American society and has gotten it to where it's at: short-termism. Everyone just looks at the next quarter, the next election.
Yeah, and the Office of the President is 4-8 years and is just more short-termism, along with individualism / cult of personality / CEO-leadership. If you want to make lasting change in the DNC, start by flipping more and more House seats to progressive from neoliberal.
The legislative seats are barely more malleable than the executive ones, and they’re a lot cheaper to buy off. Even with grassroots efforts to elect local candidates and move them up, it takes a perfect storm to actually get someone that’s even modestly different than the empty suits that largely fill those seats already.
I have zero faith in this system to execute anything other than purchased policy agendas, or empower any more than a tiny symbolic collection of people who oppose them… just enough to give the illusion of agency and stop any real organizing. I have no idea what could possibly break this pattern.
The Republicans were successful with the Tea Party in taking over the House and the Presidency, that's a model which I'd argue is really proven to work in our two party system because we all just literally watched it play out in real-time.
Arguing against that, probably comes from a cynical neoliberal perspective where the Democratic Party can't change because the argument assumes that the Democratic Party can't change.
And the alternative is definitely outright fascism and the suspension of Democracy. They've told us what they're planning on doing, just like we knew they wanted to get rid of Roe vs. Wade, we just accepted the lies about it being settled law and a political football.
If you're not willing to vote against that, then you're comfortably middle class and don't think you'll be one of the ones that are going to be hurt.
I've voted against Trump 3 times and threw money behind trying to get Sanders the nomination in 2020 instead of Biden, so when all the horrible stuff has been going down this term I don't have to tie myself in knots with rationalizations about my actions.
You are saying the candidates are forced on us by someone else. But that's just wrong, we choose the candidates. Anybody can run, there is no secret cabal that decides who can run and who can't.
Didn't happen last year when Kamila was selected by the leaders.
But in normal years candidates are successful because of the amount of money they can raise. The more they can raise the more brainwashing ads they can buy. The non so secret cabal is the donor class.
Anyone can run? You must meet requirements on age and how long you have lived in the US. You must pay fees and provide signatures for each state. If doing it through a party you have to meet their rules.
Cost to get on most states ballots at a basic level is a million. You could do it for free if you dont want to appear on any ballots.
The ideal is that anyone can run, but it's not that easy to just start an independent campaign that has a decent chance of winning. Local races are the most realistic "anyone can run" arena, but once you need a lot of travel and logistics in a large region, you either need a lot of your own resources or the support of an existing large political organization.
You do know the former head of the DNC was forced to retire after the leaked emails outed her, and basically all of the top of the DNC, extensively conspiring against Sanders in favor of Clinton? [1] You're right the cabal isn't secret - it's literally the DNC, and who they want to win is who will win, one way or the other. Just reading over that source - it's insane how blatant these people can be:
"In May 2016, MSNBC's Mika Brzezinski accused the DNC of bias against the Sanders campaign and called on Wasserman Schultz to step down. Wasserman Schultz was upset at the negative media coverage of her actions, and she emailed the political director of NBC News, Chuck Todd, that such coverage of her "must stop". Describing the coverage as the "LAST straw", she ordered the DNC's communications director to call MSNBC president Phil Griffin to demand an apology from Brzezinski."
A primary role of political parties has traditionally been to filter candidates. It's literally their raison d'etre
Sanders wasn't even a Democrat. He switched solely to run in the primary. It's neither scandalous nor surprising that the DNC would try to put up barriers between him and the nomination.
If the RNC had done it's job, Trump would never have been allowed into the primary in the first place.
The trend has been going that direction, where low turnout elections favor Ds and high turnout favors Rs. But only kind of... that holds when Trump is on the ballot. Trump seems to activate a segment of low-info, low-propensity voters who stay home when their guy isn't on the ballot. Things will probably get scrambled again once Trump is gone though.
And don't discount protests. It's crucially important to have big public and forceful displays of united opposition. The regime is unlikely to be toppled by protests, but they will weaken it.
That really matters.
In an authoritarian take-over institutions are the front-lines, not the masses. Think colleges, media, industry, courts, legal firms, local governments, etc. The dilemma those institutions will face is to follow rule-of-law or submit to authoritarian corruption. Authoritarians win when those institutions decide it's safer to submit than it is to follow the law. And when institutions (and the people within them) feel like they are twisting in the wind alone and nobody cares, they are more likely to buckle. Protest movements help reinforce the rule-of-law side of that calculation.
(The rise and fall of Orban is a great lesson on all of this)
Definitely not, you may, at best, shift the problems to someone else. Both of "our" political parties are beyond redemption and cannot be reformed (if they were very not terrible in the first place). The only thing that will change outcomes is direct action and I'm including limitless scaling of that including the armed defense of your ideals.
I'm pretty sure they didn't think this through in a comprehensive fashion.
Because making it esy to find all the rich people just seems like a very bad idea given the direction things are going.
When it was broad, the only thing you could do was locate, say, large minority groups. Blacks and latinos for instance. And even that led to problems. I can't imagine what will happen when we can drill down and tease out immigrants from citizens. Gay from straight. Rich from well to do. And so on.
Do you really need the census to find people of specific demographics in 2026? Pretty sure I can go up to anyone in any state and ask where all the Puerto Ricans live and get an answer (in many cases I'm sure I'll get stared at like I'm crazy, but that's still an answer). I know because my parents moved to predominantly Hispanic parts of Florida before fully settling down where we landed, I REALLY doubt they stopped to pull up census data to decide where to find Hispanics / Puerto Ricans in Florida. You can talk to any local of any area and figure out which areas are a specific nationality without census data.
The census data is probably too stale in most cases to really act on in the way people expect. They used it in the 1940s because that was all they really had. But today it would be stupid to use when you already have backdoors into cell and internet service.
Yeah okay fair, I was about to post a knee jerk reaction, but it's well known that the US government can obtain higher quality data by just simply buying it from the public market.
They haven't done a single thing without malicious intent. Go back and find whatever else you've defended in the past, and look at the results instead of the stated reason/goal for doing them. They won't match. They'll be opposites. You'll rationalize or shift blame, of course. But maybe this time, something decent will get through.
Cell tower data, credit bureau integration, social media scraping, palantir, smart home device surveillance, DNA database exploitation, facial recognition networks, tax, payroll, passport, visa, medicare/medicaid, immigrations and customs databases and many more...
The census is a historical relic used to jerrymander congressional seats, and that's about it.
Census data provides a reliable source to build off of, which makes joining between data sets more reliable. A lot of what you're talking about would be partial prints of an identity that have to be joined up with others to give reliable data.
Eg
> Cell tower data
That's just going to get you a subscriber and device ID, unless you're talking about going deep packet inspection and parsing the contents of the packets. You could, but that's a lot of effort to get something the census can hand you for free.
> credit bureau integration
Notoriously unreliable and identities for the purpose of credit get stolen constantly. The easiest way to clean that is against known-good info, like the census.
> social media scraping
Half the profiles are fake, also not reliable data unless you clean it up. Again, census data makes it very easy to cut out profiles that don't match a real person.
> tax, payroll
These are probably fairly reliable, although they usually won't tell you about a person's demographics.
> passport, visa, medicare/medicaid, immigrations and customs databases
There's an enormous part of the population that won't appear in these at all. The huge part of the country that's "working poor" but not poor enough for Medicaid probably aren't traveling internationally. I wouldn't be surprised if half the country doesn't appear in any of these.
The census has value in that it contains a huge depth of information, is tied with your identity, citizens are compelled by law to answer so even the privacy folks have to respond and lying on it is a crime (enforcement is probably non-existent, though).
I'm sure that can all be reconstructed to some level of accuracy given sufficient effort, but that's a lot harder and requires a ton more coordination than "SELECT * FROM census_data WHERE ..."
I have to agree. I'd like Census data to be private, but the cat is out of the bag.
I'm all for keeping all of this data private. But to think it isn't already available is a bit 'head in sand'. Maybe put laws in place for 'general' privacy across all data, before getting too inflamed about Census in particular.
Even at a quick glance this doesn't make any sense. The census is literally how they get the data. Where else would it come from? Drones? Every computer being hacked Michael Bay style?
I don't work in this industry so I don't know their secret sauce, but I would be surprised if census data is not used as a baseline for what they're selling. It doesn't make sense to not want to use it if your next best sources are relying on everyone in the household having an app that sells their location to your network constantly. I see outdated data about me on the public versions of these sites all the time, so I know they don't have omniscience.
> Title 13 provides the following protections to individuals and businesses:
> Private information is never published. It is against the law to disclose or publish any private information that identifies an individual or business such, including names, addresses (including GPS coordinates), Social Security Numbers, and telephone numbers.
> The Census Bureau collects information to produce statistics. Personal information cannot be used against respondents by any government agency or court.
> Census Bureau employees are sworn to protect confidentiality. People sworn to uphold Title 13 are legally required to maintain the confidentiality of your data. Every person with access to your data is sworn for life to protect your information and understands that the penalties for violating this law are applicable for a lifetime.
Violating the law is a serious federal crime. Anyone who violates this law will face severe penalties, including a federal prison sentence of up to five years, a fine of up to $250,000, or both.
I worked for Data company for a year. We absolutely used Census and ACS as baselines and checks. In fact, there was some talk about getting rid of ACS in Congress and we got emails about "EMAIL YOUR CONGRESSIONAL PERSON, DEMAND ACS STAY. Here are talking points."
I am indeed sworn to not reveal lots of data I knocked on doors for. My memory isn't that good, especially compared to the database it went into, anyway.
I hope it's not a baseline for individual records, but my assumption was that the census data would be pretty useful as a baseline for aggregate information, especially when it comes to comparing to private sets they're working with.
Disenfranchise who? Something I find interesting is that Europe and the United States are experiencing the largest migration in human history yet no one talks about this. Taking a step back, looking at it from a systems perspective, the migrations are working as planned. Designed to subvert Democracy. What does it mean to have a vote when you import 10 new voters who disagree with you?
One example is HIAS who runs a fully staffed Darién Gap facility. As soon as there is a friendly administration to mass migration - they just flip a switch and we will have 10 millions of migrants. I don't want to come across as MAGA, because I am a non voter - the Republicans fuel the fire with forever wars. Its really two sides of the same coin. Companies benefit, because when there is no social cohesion workers are less likely to Unionize.
2.) Democracy happens to be destroyed by local far right movements, composed of people who were there for years and did not migrated anywhere.
The extend of foreign destruction is Vance trying to destroy democracy in Europe openly, Putin doing it secretly and Musk openly enciting pogroms. None of them immigrated to EU.
The real question is why anyone answers these questions in the first place? I just wait until a census worker shows up and tell them how many people live at my domicile. It's needed for proper electoral representation and absolutely nothing else.
You first gather the data while people don't know or care. Then you weaponize it later. It happened at least once not long ago in another country, seems not overreaction to be concerned about it
It happened a year ago in this country, with IRS sharing data with ICE (breaking a longstanding policy of keeping taxpayer data private within the government).
If this is a Nazi reference, Census data was used to send people to concentration camps here during the same era. Less awful than death camps, at least.
Imagine the weaponization possibilities when combining the census data with Amazon’s and Meta’s data, and possibly several other datasets readily available to this administration. Whatever is missing from one of them can be inferred or defined from the others. This might already be happening, it can’t be checked. Some (former) dictators would be salivating.
The easy solution is to just reduce the resolution and scope of the data to the degree it is absolutely necessary. The census exists to inform representation decisions. All other concerns are addons. You can have all the data on the county or voting district level and strip data as you increase your resolution, to the point you only keep population number at the neighborhood, block level.
Knowing the racial, ethnic and socioeconomic background of the residents of a single building block is only useful to discriminate against them.
Demographic information is useful for medical, financial, educational, and so many other items.
The current admin doesn’t need it to discriminate, you can just access cameras and license plate readers and target easily that way.
The purpose is to scare people into misstating or obscuring data to reduce total house representation for an area. It’s to win votes, there are much better ways to do all these things than use this data, but effecting the vote with limited impact is a huge money savings.
This might be the point. As long as they think the people who end up under-counted are not people this government would like to have voting power for the House of Representatives.
Extremists or in general any fraction willing to engage in systematic discrimination, harassment, terrorizing or similar love highly detailed non anonymized census data.
Why?
Because it gives them the perfect layout for which areas to harass (areas likely to yield), which to brutalize (areas unlikely to yield or from especially "hated" people), which to best not touch which (areas with too much influence/money or likely to contain hidden sympathizers), which to systematically take apart through other means like building a highway through them (e.g. "hated" communities to strong/connected to brutalize). etc. etc.
All of this has a lot of history weather it's from right extremists like fascists or left extremists(1).
At which point the question is, if the data you collect is that abuseable. Should you even collect it? Is it even really needed?
(1): Like actual left extremists, the a lot of US sources have the habit to label people as left extremists which by EU standards sometimes aren't even left (but centrist) and very far away from extremism...
Any use to identify where government resources are best used, will have people thinking they should have gotten more and would have if they'd answered differently. Ie, that their answers were "weaponized" against them.
I guess the way to optimize is to find an equilibrium between an extreme of specificity and an extreme of vagueness that's still actionable from a high-level policy perspective.
Something about this conversation is fundamentally broken if there's no space to iterate towards optimization and instead it's just swinging between maximalist extremes.
There's a pretty good chance the Elon Musk, plus Russia and China have had more-orless unrestricted access to American's data since the DOGE dismantling of US government. Plus, by intentionally removing security and accountability mechanisms it makes it impossible to accurately determine how bad the damage actually was.
The Harper government actively worked on destroying the efficacy of the Canadian census, to make it more difficult for subsequent governments to make data-driven decisions.
In addition to the obvious goal of making it easier to identify and target homosexuals, trans people, minorities, immigrants, it's quite possible that destroying future governments' ability to make good decisions is one of the objectives of the Republican party. Stop voting for the face-eating leopard party, already. They don't use the litterbox, shit everywhere, and actively try to eat your face.
For all the very clever people pointing out that this is nothing new, I have two responses.
1. Your cell company may track your location, and your credit rating agencies know how many nose hairs you have, but they doesn't always (or even usually) have the deeply personal information you're supposed to put down in a census.
2. Enough of a change in degree is a change in kind. If you disagree, remember that Imperial Russia had the Okhrana and sent over a million Sybiraks - prisoners and exiles - to Siberia, and then the fucking CHEKA and the NKVD and then the (kinder, softer, slightly less outright murderous) KGB went ahead to send 18 million people into the GULAG system, and outright murdered half a million to a million. This was all the same, right? No difference?
You can’t completely trust what people say anyway. There are stated preferences and observed preferences in economics but it applies to other areas of life.
The entity most capable of weaponizing demographic data is the government itself. If people weren’t previously providing false information to the census, I’m skeptical that this change is what will push people over the edge.
Congress passed laws that blocked the federal government from fusing data across departments for this specific reason. the admin decided to ignore those, and a friendly congress is deciding to not act on that.
You really, really don't want a government who can build a unified profile on you in that way.
Isn't the issue here the lack of accountability? Maybe an unpopular opinion, but I don't think its a foregone conclusion that governments are fundamentally corrupt. Ours certainly is and we have a very weak constitution which makes it worse, but that's the US. I think better constitutions are possible, but we have to stop treating it like a sacred document and be practical.
> I don't think its a foregone conclusion that governments are fundamentally corrupt
There's a question of what you mean. Is it, can they be corrupt? have they been corrupt? are they currently corrupt (because of the previous, or incidentally)?
Plato thought Democracy was corrupt and it's the least inherently corrupt system I know of. I would say they are fundamentally corrupt. The best you can do is try to limit it with a document (like the US Constitution) and setting up a multi-branch power structure capable of adversarial action. As you point out, the US does not have that and it's showing.
I "enumerated" for the last census. Trust in my community was already not high* and I had lots of interesting encounters. I really believed the rather invasive data I was collecting with a friendly face would be used and handled responsibly. I feel for the poor souls that'll sign up to go door to door for 2030 now that the firewalls against weaponizing and monetizing all of our sensitive government data has been torn down, and even more for those that will volunteer information that can hurt them.
The comments that this rather expensive endeavour should just be about getting a head count are also amusing to me. The data collected was such an important baseline of common understanding, and this will not be a good thing for its future quality. I've grown very jaded now seeing all the things taken for granted in this country and lost or degraded recently with a whimper.
*: To be fair, they sent me specifically to places that didn't respond, so I was naturally led to believe that everyone in my region hated the government, ignored bizzarrely threatening fliers, or had recently moved and had no knowledge of the inhabitants (if any) during the census period.
Coming from a certain european country, you never know what answer on the census might get you into trouble.
"What is your religious affiliation". Seems perfectly innocuous, but turned out to be retroactively fatal if your answer could be attributed to you by a certain foreign occupier in the 1940s .
What, religious data? Are you serious? That's one of the most critical things they can track about their citizens.
Let's say your town has a lot of pig farmers. The pig farmers are afraid their business is diminishing. So they lobby the local government to put a tax on chicken and beef, to encourage more pork consumption. Which local officials might be inclined to do for economic reasons. But then you collect religious data, and it turns out 50% of the population is Jewish, Muslim, and Hindu. So half the population now has to pay a tax, which is effectively a tax on their religion, because their religious belief says they can't eat pork.
This is a made up example, but the point is that you need to know about your citizens so you can make just laws that respect those citizens (and encourage businesses, job training, etc based on demographics). It's why we have a census.
Say you get your way, and, for fear of Mark Carney rolling the tanks in and taking over North America, the US stops collecting any data on its citizens. How is the IRS supposed to know how much tax to expect from you? How is SNAP supposed to determine your eligibility? How is unemployment supposed to know if you're ripping them off or not? Data privacy is a real concern, but you need PII to run government services effectively. Running a state without collecting PII is like running a hospital without collecting any.
> How is the IRS supposed to know how much tax to expect from you? How is SNAP supposed to determine your eligibility? How is unemployment supposed to know if you're ripping them off or not?
How does knowing your religious affiliation help them with any of this?
I don't know why you understood my comment as saying government shouldn't have any data. I specifically replied to the comment about religion - there's no reason for the government to collect any data about that from individuals. Churches can report how many members they have if they want to. But it shouldn't be a question on the census.
France used to make plenty of lists. We loved lists. Lists are good. Jews lists? Sure, it's maybe useful one day when we want to do something.
Boy were the Germans happy to find these.
The American obsession with asking for people their perceived origins (AAPI, AA, Latino, ...) is more than weird: it's downright dangerous. Don't fucking ask these questions, and never, ever write it down, especially not with names.
Thankfully, now they can just buy it from data brokers and let Palantir target, so that makes life easier for them
It actually does. Religious affinity can absolutely be useful for longer trend studies, and census data is usually of much, much higher quality than other random sample studies.
Isn’t religion, for those who follow it (I don’t), one of if not the most important aspects of their identity and life’s purpose? I love breakfast food, but not that much.
Don’t some religions not get along very well?
Given your criteria, what should be asked? Check the boxes for the physical and mental illnesses you have? What’s your BMI? How much time do you spend online? What percent of your diet is highly processed foods?
Is gender/sex also nonsensical? Is languages spoken also nonsensical?
Pretty sad, in my opinion. In my ideal the state should have visibility into the shape of the people present so that we can make good decisions about our combined organization. I think we’re making a mistake we will come to regret by intentionally damaging our data collection infrastructure.
I think a large amount of the US’s success is the result of good institutions handling granular data. Policies can be adjusted to match outcomes more rapidly than otherwise.
I understand why people decide to diminish all state capacity - they feel that governments are populated by their opponents who will use state capacity against them. But as our relative strength wanes, our ability to overcome these forces of inertia does as well. And then our governments become less capable and eventually life starts getting worse.
We don’t need house-level data immediately (except perhaps in order to place census blocks within their appropriate congressional district etc). But there are aggregation units above which we should be using as good information as we possibly could be.
TFA lays out why things don't work that way. If you erode trust in the privacy of census responses, an awful lot of folks will have to start lying on their census
> Differential privacy makes this trade-off explicit, and thus impossible to ignore.
I think he has it backwards here.
Techniques like differential privacy hide the fact that a trade-off exists, except for a small cadre of experts who live and breathe this stuff.
I don’t know enough to defend this decision, but it strikes me that if there is a real trade-off, not having access to these techniques will force people other than statisticians to confront the trade-off.
If data about the public is so dangerous that we must disguise the results, then perhaps its data we shouldn’t be collecting in the first place.
Whatever you do, there is a level of trust that is assumed when census takes place. The trust that this data is then not identified in a way that could be targeted for scams, frauds, and other such evils. But in NY, house sale records are made public but much to the detriment, many mortgage companies fake a bill for payment.
Differential privacy is absolutely necessary, and the social scientists being unable to reconstruct the data at an individual level is intended. A macroscopic description is rather enough for most purposes, and anything more is asking for a surveillance state.
> But in NY, house sale records are made public but much to the detriment, many mortgage companies fake a bill for payment.
That frankly sounds more like a failure of enforcement, on top of a failure of the construction of the financial system. Here in Germany, it is absolutely not a common thing that mortgages or the banks holding them get sold like a hot potato towards some other sucker, and thus such a letter would cause immediate suspicion.
Ban it from the dataset, add it to the analysis. You can choose your own flavor of noise.
I don't know what the political undertones are here, but at some level you need to have actual ground truth, including "this person/household declined".
Publishing raw data though? That seems like shooting yourself in the foot from a national security perspective, not to mention all the other reasons not to do it.
> Differential privacy makes this trade-off explicit, and thus impossible to ignore. Maybe banning it is a way of pretending that the problem doesn't exist, in the hope that it will go away?
Or it's saying that one of these conflicting goals is more valuable than the other, and so shouldn't be sacrificed for it.
i have such a hard time reconciling stuff like this:
> The census bureau decided to adopt differential privacy for the 2020 Census
and:
> The consequences will be dire for utility or for privacy, and possibly both. It's hard to understate this point: future statistical releases will either be useless compared to past ones, or they will be incredibly unsafe
so we took the census for centuries before this point, and it was “ok.” and for the last census only we added some privacy items. but if we remove just one of those filters, we are in “dire” circumstances? but there were no privacy features before. so we’re actually still much better off than we were for hundreds of years before this.
this makes it feel like an emotional overblown problem
Believe it or not, mathematical techniques and computational power have increased in the past hundreds of years, not to mention the digitization of everything.
Privacy issues that weren’t possible before due to cost are now pennies to exploit. Also keep in mind as it points out people were using census data to drive gerrymandering efforts, so these attacks are real and have been going on for a long time.
> but there were no privacy features before. so we’re actually still much better off than we were for hundreds of years before this.
One notable thing we have today that we didn't have 100 years ago is a computer. Before, you could reasonably assume that recreating individual records wasn't feasible, at least not on a large scale. You can't assume that now. A 4 digit password was safe for hundreds of years, but it would be a security lability today for the same reason.
As the article clearly states, privacy features have been in the census since 1990. It is just that the previously used privacy feature was not very strong and could be defeated. So it was replaced by a stronger feature in 1920. Before 1990 the census. 1990 was when personal computers were being popularized and the computing power available to individuals exploded and so then it was possible to use computers to separate out individual information from the data the census publishes. So the issue came up then.
Computers and improvements in data science/machine learning are basically the entire explanation. A LOT of the techniques that we use today to de-anonymize data require computation power not previously available. Even when doable, resources limited scale. Source: statistics degree
(Also, linkage. There are more data sources to cross reference now with the internet and social media and web tracking and hacks - the record footprint of Americans even as recently as the 70s and 80s was dramatically lower!)
Can anyone explain to me the previous state and why it was desirable? I admittedly do not understand why people are getting riled up. I am not being difficult. I really don't understand the original state and the changed state here.
So "differential privacy" pretty much sounds like someone gets to modify the results of a census and how it gets modified is entirely up to their discretion.
Seems like something that could be abused to achieve political objectives.
there are obviously measures in place to ensure the added noise is statistically homogeneous. the changes don't affect the final aggregates significantly, just enough to avoid saying much *about any individual person*.
know how you can buy "anonymized" data from data brokers and drill down until it's not anonymous anymore and in many cases point to the exact person? differential privacy would prevent that kind of thing.
If someone actually wanted to achieve political objectives by tampering with census data, there are better means than tampering with homogeneous statistical fuzzing.
Not really, it has to be random in a predetermined fashion to be considered differential privacy. It is reversible in the way that someone shouting over an aicraft producing white noise is intelligible.
I guess someone could fiddle with the noise, but then why not nudge the originals? Or more insidiously, control what is published?
Any privacy-diminishing changes at federal level happening during this administration are for one reason only: to amass more power in Conservative administration/governance. At the federal level it's Project 2025, at the state level it's making sure states stay red and disenfranchise minorities.
And a lot of countries have things like national IDs that, rightly or wrongly, given things like RealID and passports, that a lot of Americans just don't like on principle.
Sure, in Europe we don't because we already have databases of all citizens, also recording attributes like race, skin color, religious affiliation or political leaning in a database is highly illegal, both for the government and for private use.
In my European country (Switzerland), it's mandatory to notify the government whenever you move. There is thus no universal census and also no voter registration. There's still subsampled surveys though, for e.g. economic data, that might come by mail (addressed to you by name, because they know where you live!).
As I understand it, in the Soviet Union you had to get government permission before moving. Whereas here the right to move is guaranteed, you just have to update your details.
Many countries effectively have a live registry of where people live, updated to within a few weeks.
A door to door census isn’t needed because they can do something like:
SELECT a.province, COUNT(DISTINCT b.id_num)
FROM registry a
INNER JOIN national_id b
ON a.nat_id_num = b.id_num
WHERE timeframe = 2026-01-01
GROUP BY 1
Correct. At least, not that I know of. On the other hand, when you move, you must deregister with your old municipality and register with your new one. The exact system differs a bit per country.
I think it should be noted that there was a lot of dissatisfaction from users of the census data as far as I know. So it's not been banned just for politicals sake or because they hate privacy... Some people I talked to in the privacy field even called the whole thing a total disaster and weren't shy to put blame on John Abowd who apparently pushed this through despite a lot of internal opposition and concerns. Not sure if that's true, but what is definitely true is that the way the data was released produced serious issues downstream as most researchers and statisticians that ingested the data weren't prepared for receiving noisy data values. Differential privacy was applied in a way such that many invariants that data users cared about weren't preserved, which was expected as it's not possible as you can't preserve all invariants and at the same time add meaningful noise to the data. The thing is, with such a differentially private data release you need to adapt all of the downstream analyses to take into account the exact mechanism the data was altered in. And since the census bureau used a very intricate mechanism that didn't just add Laplace noise to data values but instead relied on a multi-stage process that preserved some invariants but not others it was very difficult to even write routines to account for the changes being made to the data. They essentially asked of every data user to rewrite their whole analysis pipeline based on the exact disclosure mechanism that contained a large number of bespoke choices regarding which data invariants to preserve and basically produced a mix of noisy, synthesized data that was just really hard to reason about. I don't even know if there even would've been a way to do this better, but the fact is that not every small county or school district has top-tier statisticians at hand that can just read a whole monograph on differentially private synthesized census data and then hotpatch their existing analysis systems to work with that data.
I was a big fan of differential privacy but now I think it might be doing more harm than good, as I haven't seen a single case where it was applied successfully in a problem where it actually mattered, and it contributed strongly to discrediting and preventing a lot of work on other anonymization techniques as it was deemed the only way to preserve privacy by the research community, so showing up with enhancements to k-anonymity or any other noise mechanism not rooted in it was a sure way to get ridiculed and ignored. And it's just not a practical mechanism, even when it works for a single disclosure you always end up having to blow up the privacy budget to a ridiculous amount in order to keep disclosing statistics as otherwise you would for almost all real-world data run out of budget after a few publications.
So, for me it's a technique that works in the areas where it doesn't really matter (publishing highly aggregated statistics that pose almost zero privacy risk even without differential privacy) and doesn't work in other areas where it would actually matter (publishing fine-grained data about individuals or small groups). There are some niche use cases but in my view the privacy community has really overblown the importance of differential privacy by portraying it as the only way to reliably anonymize data.
BTW the German census bureau has an interesting approach to anonymization which they use for several decades already and so far I haven't heard of any cases of successful de-anonymization of the data, maybe the US bureau should have a look at that for their own needs.
Of course there will be dissatisfaction from users of the data. Anyone that wants to use census data will prefer less privacy in the data. And anytime privacy is enforced the data becomes less useful. It would be certainly very convenient for both advertisers and gerrymandering political consultants to have detailed data on every citizen.
As the article says anytime you want to enforce privacy, the data becomes somewhat less useful, there is just no way around that.
The point of rights is that we have them and that they should not be trampled upon when they become slightly inconvenient to someone in power.
Are you sure about that? You are saying that differentially private census data couldn't be used for gerrymeandering and advertisement while non differentially private data could? Hard to believe, I'm not an advertisement or gerrymeandering expert but I would assume people running ads or cutting up districts are mostly interested in aggregate statistics i.e. they won't care about single households? And I would assume they can rely on voter files, party databases etc... And to the contrary there are reports [1] that indicate differential privacy actually makes gerrymeandering analysis more difficult or impossible. So, not really an argument for differential privacy, discriminatory action can be equally well taken based on differentially private data as the government cares about groups not individuals and groups aren't protected by differential privacy. It seems people really fundamentally misunderstand what this technique can achieve and what it won't do.
> serious issues downstream as most researchers and statisticians that ingested the data weren't prepared for receiving noisy data values
They weren't prepared for data that was obviously noisy. The data has always been inherently inaccurate, and folks just chose to ignore that previously
No, there are dozens of articles discussing the mechanism and explaining the impact it had in different areas e.g. [1,2,3]. And the release mechanism wasn't just "add noise", far from it, you may read the original paper [4] to see how intricate it was, anyone wanting to make real use the resulting data would have needed to understand that approach in detail to work with the resulting data. The report of the national academies [3] is probably the most comprehensive analysis of the mechanism and the complications it introduced, so writing "it has always been inherently inaccurate" is just wrong, this new mechanism was way worse than just introducing unbiased sampling noise.
The dueling political demands of accuracy and privacy are simply incompatible at some level. After reading this, maybe Hanlon's Razor isn't the right standard. Besides malice and stupidity, there is impossibility. Some problems just aren't solvable under certain constraints. I don't envy the statisticians tasked with finding a politically palatable solution to a math problem.
But the strength of differential privacy is that you can now make this tradeoff explicit and quantify it. I always liked it because it offers a mathematical solution to a policy problem, but then of course it's up to us to decide what parameters and tradeoff to choose. Also, some data might just not get published at all if the privacy implications are too problematic, so differential privacy might buy you more signal!
Yeah, the main issue with differential privacy is that you need competent government officials making decisions who understand math beyond a high school level.
There's a ton of information in the US that is accessible to various degrees--especially through the the deep web much less background investigations. Unless you're a wealthy person who can set up various levels of trusts you can't really hide them.
You can of course disagree about what what should actually be part of a transparent public record. (Though I suspect a lot of people post-date what was generally available in a "phone book.")
I really have to take the anti-noise side here. I get why it's a hard problem, and I get why the Census Bureau thought this was a neat solution. But I'm imagining an accountant stepping through a similar chain of logic:
* I want to accurately report the finances of our company to the best of my ability.
* But that report would allow people to reconstruct private data about the terms of our contracts with various counterparties. I'd really like to avoid that, there's no rule that says we're supposed to release that data. In fact some of those contracts probably came with nondisclosure agreements!
* So here's what I'm going to do. I'm going to calculate our results to the best of my ability, and then I'm going to add random values to them and report only the randomized ones. Any reconstruction people try to do will be wrong because of the randomness.
* If the SEC says "no, you need to report your actual numbers", I will explain to them that there's no such thing as an actual number because all data is noisy.
This is a gift to reactionary gerrymandering and voting restriction efforts, along with things like yesterday's FBI raid of an Ohio voting rights organization.
Representative Beatty serves her own interests and her involvement Kennedy Center naming was just more of the same performative politics she routinely engages in. She's on the verge of being an octogenarian and missed a number of key votes, like the bill that cut funding to NPR, PBS, and other govt. programs. Kudos to her for working to remove Trump's name from the Kennedy Center but she needs to go.
headcount doesn't have to be granular, it has to be accurate. this is about the very useful street- and block-level data.
also, if how would anyone know how accurate the "transparent" number is? if Trump or Thiel can fuck with the fuzzing they can just as do so with the base data.
Frankly i see no reason to keep this data private. They should simply publish a full dataset of the census, with no such data coarsening/differential privacy/ etc...
Fundamentally this is public data. If it's to dangerous to make public, it's too dangerous to collect, and people should be aware of exactly what it is.
There are very few things that the state has data on that should not be made public. Census data is simply not one of those things.
publishing should be the default for any data, and to keep it unpublished should require substantially good reasons that impact the country as a whole. Frankly, if it isn't detailed national defence plans, i struggle to see any data that should not be public.
The biggest challenge with running a census is getting people to trust you enough to answer your questions.
A lot of census questions are sensitive. The ACS covers topics like citizenship status, disabilities, income, SNAP assistance, languages spoken at home.
If you want accurate information about the people who live in your country you need the census process to feel as safe for people to respond to as possible.
Are you saying the census shouldn't collect any data that people wouldn't be comfortable publishing? Because that's a recipe for a census that is far less useful for helping the country make useful decisions.
> Are you saying the census shouldn't collect any data that people wouldn't be comfortable publishing? Because that's a recipe for a census that is far less useful for helping the country make useful decisions.
I'll say that. The state representatives should provide congress and the president any data needed to inform policy decisions about the people they represent. And as others have pointed out, other departments and agencies (such as the IRS) have most of the rest of the data required to make policy decisions.
Except for gerrymandering purposes, I fail to see why income, party affiliations, etc., is useful for the purpose the census was created for.
>And as others have pointed out, other departments and agencies (such as the IRS) have most of the rest of the data required to make policy decisions.
There are laws in place forbidding government agencies from merging together datasets.
The last thing people should support is creating of profiles of individuals by combining data from different government agencies. This is why the census is so important as a data collection mechanism.
> There are laws in place forbidding government agencies from merging together datasets.
This is an excellent point. In my opinion, such laws are a good idea. Most of the time, policy decisions should not require IRS data. (Or other personal data.)
But to get around such laws, the government asks citizens to provide that data a second time (in the census). And sometimes it's asked yet again on other forms. This seems to defeat the purpose of those laws.
I can see that federal disaster aid might need to know if some area needs more or less aid, depending on the wealth of the area receiving aid. If aid is given to individuals, the have a need to know the individuals' income.
When there is a reasonable need to know, I would prefer the government use the much more accurate IRS data, rather than ask for people's income multiple times. The laws preventing merging federal datasets could be rethought, given what is now known about preserving privacy mathematically. I would like to see specific exemptions made, with the provided data properly anonymized to preserve privacy while serving the legitimate purpose for which the data was requested. The use of such data should require a request to congress for it.
This seems’s like an issue created by congress. the constitution only requires a headcount by state. Maybe they should use another mechanism to collect demographic data. Since the concern is not about representation, but allocation, tax returns seem like an obvious alternative and they are already private and collected at a much more granular level.
Have you filled out a federal income tax return in the US?
It absolutely asks for the names (and SSN) of any dependents. It's trivial to infer whether one of the adult(s) filing the tax return gave birth in the last 12 months based on the last 2 years of tax returns for those adult(s).
The census isn't for helping the country make any decisions other than determining the number of representatives and apportionment of taxes. It should not be collecting any data that isn't necessary for that.
> The actual Enumeration shall be made within three Years after the first Meeting of the Congress of the United States, and within every subsequent Term of ten Years, in such Manner as they shall by Law direct.
The key thing you're missing is "in such Manner as they shall by Law direct".
Congress has passed a whole bunch of laws that attach additional responsibilities to the census for the purpose of supporting government decisions.
I'd like to know when they stopped publishing census data. I have used it for genealogical purposes to track ancestors: you can see exactly who was living in which house, how they are related, and what their ages are (I found that women in my family often reported, both on the census and marriage documents, being younger than they actually were). I don't think I've seen data from after 1950, though.
I don't understand why the census would include SNAP data or income: surely the government already has that information. I have never doubted that the IRS knows my income better than I do. Maybe better use of existing datasets could restrict the census to less invasive questions.
They haven't stopped but they don't happen immediately.
Detailed census records are published 72 years after they were collected; the last release (of 1950 census data) came out in 2022; the next one should be published in 2032.
The Census Bureau is a lot more than the 10-year Census, and it already makes very extensive use of IRS data and other administrative sources. Virtually everything that is published using these sources uses either differential privacy or other privacy protection methods that are prohibited by the order. I'm guessing that a lot of those pieces of data are just going to be put on hold until the order is reversed or weakened. A number of things might have to go away permanently, as there's almost certainly no way to protect privacy in them without some kind of noise infusion.
TBH I don't think the people who wrote this knew how much collateral impact it would have.
Replying to the ACS with accurate information is required by law, so they don't actually need to rely on people feeling safe to get answers.
I don't trust the Census Bureau with my data, so if this is as "dangerous" as the author and some people here seem to think, they shouldn't be collecting it in the first place.
There's not many cases of enforcement. Non-response is taken about as seriously as the Robinson–Patman act. I think the Census Bureau is very reliant on people thinking there will be enforcement, however, which is why the materials they send all have a threatening aura. I don't know about the ACS, but for the decennial census I often felt like my job as an enumerator was just to bother people until they'd answer. The case would keep being recycled until we got at least (IIRC) a head count.
They can certainly enforce that you answer the survey. But it's very difficult to enforce a requirement that people answer questions accurately, particularly when they perceive that doing so will expose them to danger.
I don't get what danger is being referenced here that exists only if the data is released to the public (in aggregate)?
The government is the primary and arguably only source of the danger, and they already have most of the data whether you answer the ACS correctly or not.
1. People give the information to the government under the expectation that this data is to be kept private or used in such a way that individual targeting is made impossible, you break that expectation and people will lie or won't give you this data.
2. Without noise injection it's rather simple to do statistical attacks to reverse engineer individual entities.
3. This data is and has already been used in the past to undermine democratic systems by targeting and disenfranchising minorities, as well as gerrymandering the US to hell.
4. "Too dangerous to make public, too dangerous to collect" - this is a false dichotomy. To govern effectively you need sensitive data, but it should be collected and used in a way that's safe for the individuals.
5. Macro level aggregates don't need individual exposure, that's why noise, anonymization and statistical functions are fine.
>If it's to dangerous to make public, it's too dangerous to collect, and people should be aware of exactly what it is.
While this may be a reasonable stance in theory, there are many examples in reality where the danger has not materialized for decades. Personally, I have access to health records, birth certificates, and death certificates collected by a state. They contain very personal information. As far as I know, they have not been leaked to the general public.
This is one of those situations where everything you hear tells you the system is failing, but that's because nobody talks about the systems which haven't failed.
Besides, this possible failing of the Census' privacy promises shouldn't convince us that "If only we hadn't given info to the despotic and cruel government using it to target people, then we'd only have a despotic and cruel government hurting people randomly." The solution to this problem isn't to withhold info, it's to get rid of the despots.
That's a good default position, and I think should be our starting point.
But the devil is in the details. If we don't want advertisers constructing semi-complete profiles from simple web interactions then why would we publish 330 million census questionnaires for their use?
So do you believe that individual income should be public? Or do you believe that the government should not take income into account for taxation or distribution of benefits?
Then dox yourself right now with your previous census answers and PII. There are several obvious reasons to keep the data private, all you have to do is use your brain.
But why is the census asking about those attrbutes at all. The Constitution requires a count. That's it. A number. We don't need to know the rest of it, or if we do, it should be surveyed separately with voluntary participation.
> We don't need to know the rest of it, or if we do, it should be surveyed separately with voluntary participation.
But we do. A detailed census is essential for making good policy. For example, knowing the age and distribution of children across the country helps local and state governments decide where to put the next school or children's hospital. The federal govt. allocates funds for education and daycare accordingly.
The census is the best and most important measure of govt. policy. Taking it away would leave everyone worse off.
The risks of abuse are too high and historically proven to happen eventually. There are many other ways to determine where schools and hospitals are needed, such as aggregate enrollment and admission statistics.
Local school districts know where they need more or fewer schools. This sort of thing isn't any business or responsibilty of the federal government at all.
Census participation is not voluntary. Failure to provide complete or accurate data is, in theory, punishable by a fine. Last census, I intentionally provided incomplete data on the web form, which resulted in a person with a clipboard and some stern questions showing up at my door.
It’s because people are significantly more likely to lie or omit some facts if you don’t guarantee their privacy, which means your census data ends up being worth less than a pile of shit.
The alternative is to water down the census questions, which also leads you down the same path (i.e. manure as data).
So you seem to have at least a surface level of understanding of incentives.
Check this then:
If the census is responsible for allocating federal funds and congressional apportionment, what are the incentives for making census data private and encouraging people that would otherwise hide their identity?
And you seem to not realize that a census has a much wider impact than allocation of federal funds. It’s a nationwide survey done once every 10 years. No other survey compares in scale.
Now think about the data you could collect and the decisions you could make based on this data to ensure a better future for all in this country; in fact, this is a stated goal of the survey that you either didn’t know about or are willfully ignoring.
On the flip side, think about the repercussions of tainting this data and basically wasting such a valuable chance that won’t come around again for another 10 years.
We also know that this premise is simply wrong, Census is statistical survey, no party in the world is legally allowed to inspect the contents of the individual form via Title 13.
Counting illegals is not possible under the Census currently or in any point in the future most likely
First off the census is used for determining how many seats are used for congressional apportionment and allocating federal funds.
So unless you're willing to also say that counted illegals cannot used for either of those, then you're just being obtuse.
But if we can agree that they cannot be used for that then sure, lets identify and count them. If we can't identify (make non-private) and count them then why should we trust that those counts are accurate?
The replies here arguing we should publish it all are wild in the worst kind of first-order thinking way.
It’s a census: it just asks questions.
If you start publishing and weaponizing the data against people with various attributes, they’ll just lie or not answer. And then you are left with worse than nothing: bad data people try to act on.
The US Government is the entity that weaponizes the data. The most obvious example is the Census Bureau compiling lists of people of Japanese descent to imprison during WWII. That's just the most obvious one that I know of without looking up more.
The real push for this now is to form lists of people to disenfranchise.
yeah,
and implicitly force them to sell the land they own for less then it's worth, which in combination with setting up very messed up tax related laws in some states (1) which highly benefit you if you bought land longer in the past effectively "killed" a budding, wealthy, land owning Asian community and made sure it can't really regrow in that form.
(1): I think it was mainly California, but don't remember full
Canada also did this to its Japanese
Remember “leftist “ and transgender activists are terrorists now.
First they came for…
What's the actual antidote to this? 5calls.org?
Voting and getting everyone you know to vote
And civically engaging. Less than a fifth of voters regularly contact their electeds.
The people in power now are doing everything they can to make that as hard as possible by any means necessary. Good luck.
No, they are doing everything they can to ensure that only the people who are legally allowed to vote are the ones voting.
You should carefully examine the evidence you have supporting that belief. Start by observing that this is a partisan issue in which the official positions of the two major parties disagree on a factual claim, not merely the policy. A disagreement on policy can (sometimes) be chalked up to a difference in values, even though those do sometimes arise downstream of factual incorrectness. But a disagreement on facts is one with in which someone is right and someone is wrong. (Or, more complicatedly, someone is closer to accurate and someone is cherry-picking.)
If what you believe to be true is in fact true, then you should be able to comfortably go searching for evidence to falsify it and support the alternative, and fail to find such evidence, confident in your assumption that you won't find it. Either way, I hope that you desire to find the correct answer rather than the one that would be convenient for your political position.
> they are doing everything they can to ensure that only the people who are legally allowed to vote are the ones voting
Illegal voting is so rare that almost every time folks go looking for it they come up empty handed. Examples of voter suppression, on the other hand, are trivial to fine. (And both parties do it, particularly around primaries.)
In my state, we’re trying to enact a citizenship-proof requirement which penalizes women who change their name on getting married and those who can’t afford a passport. In effect, a marriage and poll tax. Ironically, this will disenfranchise the MAGA voters who are themselves pushing for it, but I’m not really going to point that out aggressively.
(That said, a legitimate fraction of American politics right now is in convincing the other side’s likely voters that elections are rigged, the oligarchs are in charge, why even bother calling your electeds or voting, eat an ice-cream sundae and talk to your AI girlfriend.)
If someone mails in my absentee ballot and I don’t complain, how do you detect that voter fraud?
Or if someone knows their friend is sick and votes without an id, how do you detect that?
It seems like there are currently many ways to vote illegally that don’t get detected.
Illegal voting is "rare" because the system is set up so that it is in most cases impossible to detect.
Turnout in the past few elections was already extraordinarily high. Clearly this isn’t the answer. And neither is protesting, considering that 4 of the 5 largest demonstrations in US history happened in the past 10 years and achieved nothing.
Well right... you also need to vote for the correct ("less-bad") people and get your friends to do the same.
Voting for the worse people makes things worse.
Ranked Choice Voting makes it easier to vote for “less bad” candidates.
RCV also tends to work against polarization, since it rewards candidates who are at least acceptable to a broad swath of the electorate.
It may not be the “answer” for all that ails the American political system, but it would help.
ETA: Unlike many other reforms it's also doable within the constraints of the current constitutional order and is hard for SCOTUS to torpedo (though I suppose I shouldn't underestimate SCOTUS).
Approval Voting would be an easier pill to swallow for most americans. It’s hard to explain “yeah Trump got the most #1 votes but still lost” and easy to explain “this other candidate got the most checkmarks”.
https://www.rangevoting.org/CompChart.html#votsysts
You also need the option to vote for "less-bad" people. Where I am now, my vote doesn't matter, even if it means the "less-bad" people win with no competition (as opposed to where I moved from where things were skewed the opposite way).
Vote strategically. Candidates notice btw.
So if you're in a heavily red state but you're blue, then vote in the primaries for the more centered Republican. If you're in a heavily blue state but red, do the opposite. Either way this actually helps because more people are centered and we're getting wilder and wilder candidates because there's increased tribalism. They go to the extremes because they get more voters that way. They figured out that the mainstay voters will just end up voting left or right regardless, and that by catering to the extremes it actually pulls the mainstream voters too. (Both Reps and Dems are using this strategy)
Remember, you don't have to vote for the person you actually like.
And keep doing this until we get a sane voting system which can embed actual preference (any of the cardinal systems: i.e. Approval or STAR). This strategy still works with ordinal systems (i.e. Ranked Choice) because a weak spoiler is still really good at splitting the vote (happened in a pretty famous Alaska election).
Somewhere between just going out to vote and revolution sits moving to an area where your vote counts.
I’ve not quite reached that threshold, but I avoid moving to DC due to the lack of voting rights.
Protesting one weekend day every 6 months will obviously do nothing. The pressure needs to be non stop
> Turnout in the past few elections was already extraordinarily high.
In a sense, this in itself is the issue. It's long-term _worse_ to vote for the "lesser of two serious evils". This extreme "long-term pain for short-term gain" attitude is what's gotten the US to where it is. If in 2016 of 2024 even 20% of the dems would've stayed home or voted third party, the DNC's continuous forcing of awful corpocrats with zero charisma would've become completely untenable and Trump would've been limited to one term. Yet instead they were rewarded for it, so you'll see Newsom get the candidacy and presidency in 2028 (if 2028 even happens at this point), and then in 2032 you'll get something like Hegseth or Thiel winning and it's all over.
There is an answer: relentlessly vote, but only for candidates who are actually slightly decent - including third-party - and otherwise stay at home. "Relentlessly" means "at every level", including locally from the very bottom, all the way up.
The whole idea of "third-party voting is a complete waste in the US" is incredibly dumb because a vote for someone who loses isn't a wasted vote. It shows the others that there's a voter there who can be convinced if catered to, if they select a better candidate. The powers that be have done a fantastic job of brainwashing the entire population of the myth that anyone who _doesn't_ go out and vote for either major candidate is a morally bankrupt person, because it directly benefits them.
The reply to this will be "well it's too late for that now!". It's wrong because the alternative doesn't help you one bit. You're just wishing for a miracle, that in 4 years something happens, kicking the can down the road making things worse long term. And that's actually what's got you here.
It's a symptom of the terminal disease which has infected all layers of American society and has gotten it to where it's at: short-termism. Everyone just looks at the next quarter, the next election. China's ascendency is 1:1 tied to doing the exact opposite. Some smartypants will now point "but zero Covid", great you found a potential exception, now look at the other 90% of policy.
Every time I've explained this I've gotten instantly downvoted without a single reply making an argument against it, because it's too painful for people to admit that they've been part of the road to where the US is at. And again, short-termism: rather feel the short-term tiny dopamine hit by slamming that downvote button than thinking about it. Let's see if this happens again.
> The whole idea of "third-party voting is a complete waste in the US" is incredibly dumb because a vote for someone who loses isn't a wasted vote.
Yes, but with a caveat, if you had a strong preference between the top two actually-likely-to-win candidates (assuming the third party wasn't competitive), it's at least not voting the most in your interests for the outcome. Which is why we really need approval voting, so we can actually vote for the candidates we like, rather than needing to "strategically" hold our noses.
But I agree with the rest of it, if none of the candidates represent you, the third-party vote at least allows you to send a signal of "I vote, but you need to make me want to vote for you, and this is what I want".
> Which is why we really need approval voting, so we can actually vote for the candidates we like, rather than needing to "strategically" hold our noses.
Approval voting would not end strategic voting.[1]
[1] https://en.wikipedia.org/wiki/Approval_voting#Strategic_voti...
> Every time I've explained this I've gotten instantly downvoted without a single reply making an argument against it,
Ok I'll break it down for you.
> If in 2016 of 2024 even 20% of the dems would've stayed home or voted third party
Parties cater to their bases, and putting yourself out there as an unreliable voting bloc is exactly how you get your demands ignored.
> The whole idea of "third-party voting is a complete waste in the US" is incredibly dumb
It's not incredibly dumb, it's simple mathematical reality. This doesn't change unless the first past the post system changes. Why do you think the GOP backs the Green Party?
> Parties cater to their bases, and putting yourself out there as an unreliable voting bloc is exactly how you get your demands ignored.
I wish more people understood this. Instead, there's this mistaken notion that you give away leverage by supplying votes. It's literally the opposite.
Your coalition will have more influence and leverage within a party by supplying votes, not withholding them.
> 2024 [...] the DNC's continuous forcing of awful corpocrats with zero charisma would've become completely untenable and Trump would've been limited to one term.
You mean the 2024 election cycle where incumbents all around the globe were beaten because the economic situation was strongly anti-incumbent? Are you positing that the US election was somehow a unique outlier and solely down to Harris being the Democrat candidate? Even though a swing of 115k votes would have handed the presidency to Harris instead?
It sounds like you have a particular issue with the 2016 and 2024 elections and I'm wondering if there's something in common that might explain it...
Democratic candidate, not "Democrat candidate".
https://en.wikipedia.org/wiki/Democrat_Party_%28epithet%29?w...
Hillary Clinton won the popular vote. People didn't want Trump in 2016, they wanted her. But he won the electoral college,
He won the popular vote in 2024, but the tight margin in the electoral college suggests a democratically elected Democratic candidate (i.e. one selected by a primary, not one appointed by the sitting president) could have won instead. Other potential candidates were polling better than Harriss. I personally think Gretchen Whitmer could have successfully distanced herself from the Biden administration and defeated Trump.
> Every time I've explained this I've gotten instantly downvoted
Because it's dumb. People don't want to hear dumb ideas, or take the time to try and convince someone that would spend however long it took to type that, apparently multiple times, without realizing it. Throwing away votes will never be the reasonable thing to do. I know you don't want to hear that, because it's too painful for you to admit there's no simple answer.
Please, explain to me how voting for a candidate you don't like is not throwing away your vote, but voting for a candidate you support is.
> The whole idea of "third-party voting is a complete waste in the US" is incredibly dumb because a vote for someone who loses isn't a wasted vote. It shows the others that there's a voter there who can be convinced if catered to, if they select a better candidate.
Tried that in 2000, voting for Nader as a protest vote against Clinton/Gore third way neoliberalism. I did that in a state where the electoral votes for Dems were 100% safe. Still just got blamed for Bush and there was zero self-reflection on the part of the Democratic Party.
...
I would urge everyone to stop fixating on the Presidential vote as the only fight to win and everything being win/lose based on that outcome. If the Congressional Progressive Caucus in the House exceeds 50% of Democrats in the House, then we can start thinking about a world where e.g. AOC might be the speaker of the House rather than Nancy Pelosi.
> It's a symptom of the terminal disease which has infected all layers of American society and has gotten it to where it's at: short-termism. Everyone just looks at the next quarter, the next election.
Yeah, and the Office of the President is 4-8 years and is just more short-termism, along with individualism / cult of personality / CEO-leadership. If you want to make lasting change in the DNC, start by flipping more and more House seats to progressive from neoliberal.
The legislative seats are barely more malleable than the executive ones, and they’re a lot cheaper to buy off. Even with grassroots efforts to elect local candidates and move them up, it takes a perfect storm to actually get someone that’s even modestly different than the empty suits that largely fill those seats already.
I have zero faith in this system to execute anything other than purchased policy agendas, or empower any more than a tiny symbolic collection of people who oppose them… just enough to give the illusion of agency and stop any real organizing. I have no idea what could possibly break this pattern.
The Republicans were successful with the Tea Party in taking over the House and the Presidency, that's a model which I'd argue is really proven to work in our two party system because we all just literally watched it play out in real-time.
Arguing against that, probably comes from a cynical neoliberal perspective where the Democratic Party can't change because the argument assumes that the Democratic Party can't change.
And the alternative is definitely outright fascism and the suspension of Democracy. They've told us what they're planning on doing, just like we knew they wanted to get rid of Roe vs. Wade, we just accepted the lies about it being settled law and a political football.
If you're not willing to vote against that, then you're comfortably middle class and don't think you'll be one of the ones that are going to be hurt.
I've voted against Trump 3 times and threw money behind trying to get Sanders the nomination in 2020 instead of Biden, so when all the horrible stuff has been going down this term I don't have to tie myself in knots with rationalizations about my actions.
You are saying the candidates are forced on us by someone else. But that's just wrong, we choose the candidates. Anybody can run, there is no secret cabal that decides who can run and who can't.
Didn't happen last year when Kamila was selected by the leaders.
But in normal years candidates are successful because of the amount of money they can raise. The more they can raise the more brainwashing ads they can buy. The non so secret cabal is the donor class.
Anyone can run? You must meet requirements on age and how long you have lived in the US. You must pay fees and provide signatures for each state. If doing it through a party you have to meet their rules.
Cost to get on most states ballots at a basic level is a million. You could do it for free if you dont want to appear on any ballots.
The ideal is that anyone can run, but it's not that easy to just start an independent campaign that has a decent chance of winning. Local races are the most realistic "anyone can run" arena, but once you need a lot of travel and logistics in a large region, you either need a lot of your own resources or the support of an existing large political organization.
You do know the former head of the DNC was forced to retire after the leaked emails outed her, and basically all of the top of the DNC, extensively conspiring against Sanders in favor of Clinton? [1] You're right the cabal isn't secret - it's literally the DNC, and who they want to win is who will win, one way or the other. Just reading over that source - it's insane how blatant these people can be:
"In May 2016, MSNBC's Mika Brzezinski accused the DNC of bias against the Sanders campaign and called on Wasserman Schultz to step down. Wasserman Schultz was upset at the negative media coverage of her actions, and she emailed the political director of NBC News, Chuck Todd, that such coverage of her "must stop". Describing the coverage as the "LAST straw", she ordered the DNC's communications director to call MSNBC president Phil Griffin to demand an apology from Brzezinski."
[1] - https://en.wikipedia.org/wiki/2016_Democratic_National_Commi...
I mean, why wouldn't they? Bernie is not a Democrat, he's independent. Winning elections for their own party is the whole reason DNC exists.
A primary role of political parties has traditionally been to filter candidates. It's literally their raison d'etre
Sanders wasn't even a Democrat. He switched solely to run in the primary. It's neither scandalous nor surprising that the DNC would try to put up barriers between him and the nomination.
If the RNC had done it's job, Trump would never have been allowed into the primary in the first place.
The trend has been going that direction, where low turnout elections favor Ds and high turnout favors Rs. But only kind of... that holds when Trump is on the ballot. Trump seems to activate a segment of low-info, low-propensity voters who stay home when their guy isn't on the ballot. Things will probably get scrambled again once Trump is gone though.
And don't discount protests. It's crucially important to have big public and forceful displays of united opposition. The regime is unlikely to be toppled by protests, but they will weaken it.
That really matters.
In an authoritarian take-over institutions are the front-lines, not the masses. Think colleges, media, industry, courts, legal firms, local governments, etc. The dilemma those institutions will face is to follow rule-of-law or submit to authoritarian corruption. Authoritarians win when those institutions decide it's safer to submit than it is to follow the law. And when institutions (and the people within them) feel like they are twisting in the wind alone and nobody cares, they are more likely to buckle. Protest movements help reinforce the rule-of-law side of that calculation.
(The rise and fall of Orban is a great lesson on all of this)
Also see: https://essayx.substack.com/p/the-35-percent-rule-just-made-...
Definitely not, you may, at best, shift the problems to someone else. Both of "our" political parties are beyond redemption and cannot be reformed (if they were very not terrible in the first place). The only thing that will change outcomes is direct action and I'm including limitless scaling of that including the armed defense of your ideals.
Violence. But people are still pretty comfortable so here we sit.
The actual effective strategy is “tit for tat”
Making the wealthy scared
I'm pretty sure they didn't think this through in a comprehensive fashion.
Because making it esy to find all the rich people just seems like a very bad idea given the direction things are going.
When it was broad, the only thing you could do was locate, say, large minority groups. Blacks and latinos for instance. And even that led to problems. I can't imagine what will happen when we can drill down and tease out immigrants from citizens. Gay from straight. Rich from well to do. And so on.
Do you really need the census to find people of specific demographics in 2026? Pretty sure I can go up to anyone in any state and ask where all the Puerto Ricans live and get an answer (in many cases I'm sure I'll get stared at like I'm crazy, but that's still an answer). I know because my parents moved to predominantly Hispanic parts of Florida before fully settling down where we landed, I REALLY doubt they stopped to pull up census data to decide where to find Hispanics / Puerto Ricans in Florida. You can talk to any local of any area and figure out which areas are a specific nationality without census data.
Does anyone actually believe this crap?
You think the census is what the government would use to mass identify and imprison people, not the NSA database(s)?
You think homeland security, or the FBI, or any other alphabet agency doesn't already have access to a giant list of people?
Think about what meta knows about everyone, or Google. You do realize that the US gov has read access to their core databases right?
"The census" has absolutely no bearing on any of that which you're worried about.
It's just shocking the level of ignorance that gets upvoted in the comments here now.
You think they wouldn't use every tool available to then, including the census data?
The census data is probably too stale in most cases to really act on in the way people expect. They used it in the 1940s because that was all they really had. But today it would be stupid to use when you already have backdoors into cell and internet service.
> Think about what meta knows about everyone, or Google.
Not everybody uses it and not everybody who uses it uses it naively enough to give access to useful identity info.
What's shocking is how people keep finding excuses. "what about Meta" is not one
Yeah okay fair, I was about to post a knee jerk reaction, but it's well known that the US government can obtain higher quality data by just simply buying it from the public market.
> Does anyone actually believe this crap?
> You think the census is what the government would use to mass identify and imprison people, not the NSA database(s)?
I think, and history shows, they would use the tools at their disposal.
Example: https://stateline.org/2026/01/20/ice-is-using-medicaid-data-...
They haven't done a single thing without malicious intent. Go back and find whatever else you've defended in the past, and look at the results instead of the stated reason/goal for doing them. They won't match. They'll be opposites. You'll rationalize or shift blame, of course. But maybe this time, something decent will get through.
I'm not sure why your comment is grayed out.
Cell tower data, credit bureau integration, social media scraping, palantir, smart home device surveillance, DNA database exploitation, facial recognition networks, tax, payroll, passport, visa, medicare/medicaid, immigrations and customs databases and many more...
The census is a historical relic used to jerrymander congressional seats, and that's about it.
Census data provides a reliable source to build off of, which makes joining between data sets more reliable. A lot of what you're talking about would be partial prints of an identity that have to be joined up with others to give reliable data.
Eg
> Cell tower data
That's just going to get you a subscriber and device ID, unless you're talking about going deep packet inspection and parsing the contents of the packets. You could, but that's a lot of effort to get something the census can hand you for free.
> credit bureau integration
Notoriously unreliable and identities for the purpose of credit get stolen constantly. The easiest way to clean that is against known-good info, like the census.
> social media scraping
Half the profiles are fake, also not reliable data unless you clean it up. Again, census data makes it very easy to cut out profiles that don't match a real person.
> tax, payroll
These are probably fairly reliable, although they usually won't tell you about a person's demographics.
> passport, visa, medicare/medicaid, immigrations and customs databases
There's an enormous part of the population that won't appear in these at all. The huge part of the country that's "working poor" but not poor enough for Medicaid probably aren't traveling internationally. I wouldn't be surprised if half the country doesn't appear in any of these.
The census has value in that it contains a huge depth of information, is tied with your identity, citizens are compelled by law to answer so even the privacy folks have to respond and lying on it is a crime (enforcement is probably non-existent, though).
I'm sure that can all be reconstructed to some level of accuracy given sufficient effort, but that's a lot harder and requires a ton more coordination than "SELECT * FROM census_data WHERE ..."
I have to agree. I'd like Census data to be private, but the cat is out of the bag.
I'm all for keeping all of this data private. But to think it isn't already available is a bit 'head in sand'. Maybe put laws in place for 'general' privacy across all data, before getting too inflamed about Census in particular.
Even at a quick glance this doesn't make any sense. The census is literally how they get the data. Where else would it come from? Drones? Every computer being hacked Michael Bay style?
Data mining companies?
Don’t forget there’s an entire industry that exists solely for this purpose.
I don't work in this industry so I don't know their secret sauce, but I would be surprised if census data is not used as a baseline for what they're selling. It doesn't make sense to not want to use it if your next best sources are relying on everyone in the household having an app that sells their location to your network constantly. I see outdated data about me on the public versions of these sites all the time, so I know they don't have omniscience.
It is probably not a baseline for what they're selling.
https://www.census.gov/about/history/bureau-history/agency-h...
> Title 13 provides the following protections to individuals and businesses:
> Private information is never published. It is against the law to disclose or publish any private information that identifies an individual or business such, including names, addresses (including GPS coordinates), Social Security Numbers, and telephone numbers.
> The Census Bureau collects information to produce statistics. Personal information cannot be used against respondents by any government agency or court.
> Census Bureau employees are sworn to protect confidentiality. People sworn to uphold Title 13 are legally required to maintain the confidentiality of your data. Every person with access to your data is sworn for life to protect your information and understands that the penalties for violating this law are applicable for a lifetime. Violating the law is a serious federal crime. Anyone who violates this law will face severe penalties, including a federal prison sentence of up to five years, a fine of up to $250,000, or both.
I worked for Data company for a year. We absolutely used Census and ACS as baselines and checks. In fact, there was some talk about getting rid of ACS in Congress and we got emails about "EMAIL YOUR CONGRESSIONAL PERSON, DEMAND ACS STAY. Here are talking points."
I am indeed sworn to not reveal lots of data I knocked on doors for. My memory isn't that good, especially compared to the database it went into, anyway.
I hope it's not a baseline for individual records, but my assumption was that the census data would be pretty useful as a baseline for aggregate information, especially when it comes to comparing to private sets they're working with.
You’d be surprised how bad/incomplete the data usually is and how much work it takes to make those datasets really useful or reliable.
That’s one reason Xoom Info was able to sell for a billion dollars and even their data has a lot of junk
Having well curated detailed census data would be a major boon for the data brokers
Disenfranchise who? Something I find interesting is that Europe and the United States are experiencing the largest migration in human history yet no one talks about this. Taking a step back, looking at it from a systems perspective, the migrations are working as planned. Designed to subvert Democracy. What does it mean to have a vote when you import 10 new voters who disagree with you?
How do you get the idea that migration is "planned"? You've lost the plot entirely
One example is HIAS who runs a fully staffed Darién Gap facility. As soon as there is a friendly administration to mass migration - they just flip a switch and we will have 10 millions of migrants. I don't want to come across as MAGA, because I am a non voter - the Republicans fuel the fire with forever wars. Its really two sides of the same coin. Companies benefit, because when there is no social cohesion workers are less likely to Unionize.
Someone must be funding the NGOs organizing it all.
George Soros, probably
As planned by whom?
The Epstein class. Bankers, Zionists who lost their civilization and Temple 2000 years ago:
A good google search to view art owned by one family: "Artist funded by de Rothschild's depicts racial violence and rape themes"
Immigrants don’t get a vote in many countries
For example Europeans were excluded from the Brexit vote
1.) Migration is constant and big topic.
2.) Democracy happens to be destroyed by local far right movements, composed of people who were there for years and did not migrated anywhere.
The extend of foreign destruction is Vance trying to destroy democracy in Europe openly, Putin doing it secretly and Musk openly enciting pogroms. None of them immigrated to EU.
The real question is why anyone answers these questions in the first place? I just wait until a census worker shows up and tell them how many people live at my domicile. It's needed for proper electoral representation and absolutely nothing else.
You first gather the data while people don't know or care. Then you weaponize it later. It happened at least once not long ago in another country, seems not overreaction to be concerned about it
It happened a year ago in this country, with IRS sharing data with ICE (breaking a longstanding policy of keeping taxpayer data private within the government).
If this is a Nazi reference, Census data was used to send people to concentration camps here during the same era. Less awful than death camps, at least.
Imagine the weaponization possibilities when combining the census data with Amazon’s and Meta’s data, and possibly several other datasets readily available to this administration. Whatever is missing from one of them can be inferred or defined from the others. This might already be happening, it can’t be checked. Some (former) dictators would be salivating.
The easy solution is to just reduce the resolution and scope of the data to the degree it is absolutely necessary. The census exists to inform representation decisions. All other concerns are addons. You can have all the data on the county or voting district level and strip data as you increase your resolution, to the point you only keep population number at the neighborhood, block level.
Knowing the racial, ethnic and socioeconomic background of the residents of a single building block is only useful to discriminate against them.
Demographic information is useful for medical, financial, educational, and so many other items.
The current admin doesn’t need it to discriminate, you can just access cameras and license plate readers and target easily that way.
The purpose is to scare people into misstating or obscuring data to reduce total house representation for an area. It’s to win votes, there are much better ways to do all these things than use this data, but effecting the vote with limited impact is a huge money savings.
There are plenty of other uses - knowing where to build stores to serve your target market, predicting possible pandemic vulnerabilities, etc.
This might be the point. As long as they think the people who end up under-counted are not people this government would like to have voting power for the House of Representatives.
Yes.
Extremists or in general any fraction willing to engage in systematic discrimination, harassment, terrorizing or similar love highly detailed non anonymized census data.
Why?
Because it gives them the perfect layout for which areas to harass (areas likely to yield), which to brutalize (areas unlikely to yield or from especially "hated" people), which to best not touch which (areas with too much influence/money or likely to contain hidden sympathizers), which to systematically take apart through other means like building a highway through them (e.g. "hated" communities to strong/connected to brutalize). etc. etc.
All of this has a lot of history weather it's from right extremists like fascists or left extremists(1).
At which point the question is, if the data you collect is that abuseable. Should you even collect it? Is it even really needed?
(1): Like actual left extremists, the a lot of US sources have the habit to label people as left extremists which by EU standards sometimes aren't even left (but centrist) and very far away from extremism...
Then maybe the data shouldn’t be collected in the first place?
This administration does ... not ... care ... about ... facts.
Any use to identify where government resources are best used, will have people thinking they should have gotten more and would have if they'd answered differently. Ie, that their answers were "weaponized" against them.
I guess the way to optimize is to find an equilibrium between an extreme of specificity and an extreme of vagueness that's still actionable from a high-level policy perspective.
Something about this conversation is fundamentally broken if there's no space to iterate towards optimization and instead it's just swinging between maximalist extremes.
It's a census: it's only function is to determine the number of representatives your state should have.
Please don't ask about my toilets, my demographics, or my religion.
Thanks.
There's a pretty good chance the Elon Musk, plus Russia and China have had more-orless unrestricted access to American's data since the DOGE dismantling of US government. Plus, by intentionally removing security and accountability mechanisms it makes it impossible to accurately determine how bad the damage actually was.
> they’ll just lie or not answer
The Harper government actively worked on destroying the efficacy of the Canadian census, to make it more difficult for subsequent governments to make data-driven decisions.
In addition to the obvious goal of making it easier to identify and target homosexuals, trans people, minorities, immigrants, it's quite possible that destroying future governments' ability to make good decisions is one of the objectives of the Republican party. Stop voting for the face-eating leopard party, already. They don't use the litterbox, shit everywhere, and actively try to eat your face.
For all the very clever people pointing out that this is nothing new, I have two responses.
1. Your cell company may track your location, and your credit rating agencies know how many nose hairs you have, but they doesn't always (or even usually) have the deeply personal information you're supposed to put down in a census.
2. Enough of a change in degree is a change in kind. If you disagree, remember that Imperial Russia had the Okhrana and sent over a million Sybiraks - prisoners and exiles - to Siberia, and then the fucking CHEKA and the NKVD and then the (kinder, softer, slightly less outright murderous) KGB went ahead to send 18 million people into the GULAG system, and outright murdered half a million to a million. This was all the same, right? No difference?
have you not been paying attention for 10 years? At the top of the rotting snakehead they know all this, they arn't arguing in good faith.
You can’t completely trust what people say anyway. There are stated preferences and observed preferences in economics but it applies to other areas of life.
>It’s a census: it just asks questions.
Thats what dutch and french bureaucrats thought until 1940.
The entity most capable of weaponizing demographic data is the government itself. If people weren’t previously providing false information to the census, I’m skeptical that this change is what will push people over the edge.
Congress passed laws that blocked the federal government from fusing data across departments for this specific reason. the admin decided to ignore those, and a friendly congress is deciding to not act on that.
You really, really don't want a government who can build a unified profile on you in that way.
Isn't the issue here the lack of accountability? Maybe an unpopular opinion, but I don't think its a foregone conclusion that governments are fundamentally corrupt. Ours certainly is and we have a very weak constitution which makes it worse, but that's the US. I think better constitutions are possible, but we have to stop treating it like a sacred document and be practical.
> I don't think its a foregone conclusion that governments are fundamentally corrupt
There's a question of what you mean. Is it, can they be corrupt? have they been corrupt? are they currently corrupt (because of the previous, or incidentally)?
Plato thought Democracy was corrupt and it's the least inherently corrupt system I know of. I would say they are fundamentally corrupt. The best you can do is try to limit it with a document (like the US Constitution) and setting up a multi-branch power structure capable of adversarial action. As you point out, the US does not have that and it's showing.
I "enumerated" for the last census. Trust in my community was already not high* and I had lots of interesting encounters. I really believed the rather invasive data I was collecting with a friendly face would be used and handled responsibly. I feel for the poor souls that'll sign up to go door to door for 2030 now that the firewalls against weaponizing and monetizing all of our sensitive government data has been torn down, and even more for those that will volunteer information that can hurt them.
The comments that this rather expensive endeavour should just be about getting a head count are also amusing to me. The data collected was such an important baseline of common understanding, and this will not be a good thing for its future quality. I've grown very jaded now seeing all the things taken for granted in this country and lost or degraded recently with a whimper.
*: To be fair, they sent me specifically to places that didn't respond, so I was naturally led to believe that everyone in my region hated the government, ignored bizzarrely threatening fliers, or had recently moved and had no knowledge of the inhabitants (if any) during the census period.
I did similar and you summarized the feelings well. It's really sad and hard to rebuild that trust
And disheartening that people continue to gravitate to a political party that proudly announces desires to abuse this data.
Coming from a certain european country, you never know what answer on the census might get you into trouble.
"What is your religious affiliation". Seems perfectly innocuous, but turned out to be retroactively fatal if your answer could be attributed to you by a certain foreign occupier in the 1940s .
Surely any such foreign occupier would just demand the unredacted data?
Exactly why a government may refrain from collecting such data, as it is not even relevant in any kind of policy decision.
What, religious data? Are you serious? That's one of the most critical things they can track about their citizens.
Let's say your town has a lot of pig farmers. The pig farmers are afraid their business is diminishing. So they lobby the local government to put a tax on chicken and beef, to encourage more pork consumption. Which local officials might be inclined to do for economic reasons. But then you collect religious data, and it turns out 50% of the population is Jewish, Muslim, and Hindu. So half the population now has to pay a tax, which is effectively a tax on their religion, because their religious belief says they can't eat pork.
This is a made up example, but the point is that you need to know about your citizens so you can make just laws that respect those citizens (and encourage businesses, job training, etc based on demographics). It's why we have a census.
Yes, which is why the government shouldn't have this data at all in the first place.
Say you get your way, and, for fear of Mark Carney rolling the tanks in and taking over North America, the US stops collecting any data on its citizens. How is the IRS supposed to know how much tax to expect from you? How is SNAP supposed to determine your eligibility? How is unemployment supposed to know if you're ripping them off or not? Data privacy is a real concern, but you need PII to run government services effectively. Running a state without collecting PII is like running a hospital without collecting any.
> How is the IRS supposed to know how much tax to expect from you? How is SNAP supposed to determine your eligibility? How is unemployment supposed to know if you're ripping them off or not?
How does knowing your religious affiliation help them with any of this?
"Collecting data" is what helps them.
They should follow the principle of least privilege. Why not use differential privacy?
I don't know why you understood my comment as saying government shouldn't have any data. I specifically replied to the comment about religion - there's no reason for the government to collect any data about that from individuals. Churches can report how many members they have if they want to. But it shouldn't be a question on the census.
That's where you hope people like Rene Carmille are around. S
France used to make plenty of lists. We loved lists. Lists are good. Jews lists? Sure, it's maybe useful one day when we want to do something.
Boy were the Germans happy to find these.
The American obsession with asking for people their perceived origins (AAPI, AA, Latino, ...) is more than weird: it's downright dangerous. Don't fucking ask these questions, and never, ever write it down, especially not with names.
Thankfully, now they can just buy it from data brokers and let Palantir target, so that makes life easier for them
"What is your religious affiliation" makes absolutely no sense in a census exercise. IMO.
The U.S. Census Bureau collects tons of data unrelated to the decennial counting for Congressional apportionment.
https://www.census.gov/programs-surveys.html
The American Community Survey is the most well-known, as it replaced the “long form” sampling that had been an extension to the Census.
Unless you’re a government explicitly and openly aligned with Christian nationalists.
It actually does. Religious affinity can absolutely be useful for longer trend studies, and census data is usually of much, much higher quality than other random sample studies.
The point might be going over my head… why does it make no sense?
why it makes sense? please try to answer. what action of the gov would change based on that data?
then, make it so your answer is more valid than if they asked what you usually have for breakfast.
i guarantee you more gov actions can be positively impacted by the breakfast question than the religion one.
the ONLY use for religious data is to get it for free for campaigns.
Isn’t religion, for those who follow it (I don’t), one of if not the most important aspects of their identity and life’s purpose? I love breakfast food, but not that much.
Don’t some religions not get along very well?
Given your criteria, what should be asked? Check the boxes for the physical and mental illnesses you have? What’s your BMI? How much time do you spend online? What percent of your diet is highly processed foods?
Is gender/sex also nonsensical? Is languages spoken also nonsensical?
Asking about your religion on the census is against the law in the US:
no person shall be compelled to disclose information relative to his religious beliefs or to membership in a religious body.
https://www.congress.gov/94/statute/STATUTE-90/STATUTE-90-Pg...
> compelled
Doesn't that mean they can ask that question with an option for "rather not disclose"?
Religion is just an example. Don't dwell on it.
Pretty sad, in my opinion. In my ideal the state should have visibility into the shape of the people present so that we can make good decisions about our combined organization. I think we’re making a mistake we will come to regret by intentionally damaging our data collection infrastructure.
I think a large amount of the US’s success is the result of good institutions handling granular data. Policies can be adjusted to match outcomes more rapidly than otherwise.
I understand why people decide to diminish all state capacity - they feel that governments are populated by their opponents who will use state capacity against them. But as our relative strength wanes, our ability to overcome these forces of inertia does as well. And then our governments become less capable and eventually life starts getting worse.
We don’t need house-level data immediately (except perhaps in order to place census blocks within their appropriate congressional district etc). But there are aggregation units above which we should be using as good information as we possibly could be.
> I think we’re making a mistake we will come to regret by intentionally damaging our data collection infrastructure.
Intentionally damaging infrastructure is the recurring theme of this administration.
I’d be more interested in giving my state detailed info, letting them run programs. The country can have aggregate data.
That works great for real states, but some states are just three mining companies in a trenchcoat.
But this article is about a decision to damage the census less. If you value an accurate census, you should be celebrating!
TFA lays out why things don't work that way. If you erode trust in the privacy of census responses, an awful lot of folks will have to start lying on their census
I think the TFA was very light on evidence that people's desire to respond to the census is increased if the government fabricates the results later
> Differential privacy makes this trade-off explicit, and thus impossible to ignore.
I think he has it backwards here.
Techniques like differential privacy hide the fact that a trade-off exists, except for a small cadre of experts who live and breathe this stuff.
I don’t know enough to defend this decision, but it strikes me that if there is a real trade-off, not having access to these techniques will force people other than statisticians to confront the trade-off.
If data about the public is so dangerous that we must disguise the results, then perhaps its data we shouldn’t be collecting in the first place.
Whatever you do, there is a level of trust that is assumed when census takes place. The trust that this data is then not identified in a way that could be targeted for scams, frauds, and other such evils. But in NY, house sale records are made public but much to the detriment, many mortgage companies fake a bill for payment.
Differential privacy is absolutely necessary, and the social scientists being unable to reconstruct the data at an individual level is intended. A macroscopic description is rather enough for most purposes, and anything more is asking for a surveillance state.
> But in NY, house sale records are made public but much to the detriment, many mortgage companies fake a bill for payment.
That frankly sounds more like a failure of enforcement, on top of a failure of the construction of the financial system. Here in Germany, it is absolutely not a common thing that mortgages or the banks holding them get sold like a hot potato towards some other sucker, and thus such a letter would cause immediate suspicion.
Ban it from the dataset, add it to the analysis. You can choose your own flavor of noise.
I don't know what the political undertones are here, but at some level you need to have actual ground truth, including "this person/household declined".
Publishing raw data though? That seems like shooting yourself in the foot from a national security perspective, not to mention all the other reasons not to do it.
> Ban it from the dataset, add it to the analysis. You can choose your own flavor of noise.
It is introduced in the public data, not the secret data.
> Differential privacy makes this trade-off explicit, and thus impossible to ignore. Maybe banning it is a way of pretending that the problem doesn't exist, in the hope that it will go away?
Or it's saying that one of these conflicting goals is more valuable than the other, and so shouldn't be sacrificed for it.
i have such a hard time reconciling stuff like this:
> The census bureau decided to adopt differential privacy for the 2020 Census
and:
> The consequences will be dire for utility or for privacy, and possibly both. It's hard to understate this point: future statistical releases will either be useless compared to past ones, or they will be incredibly unsafe
so we took the census for centuries before this point, and it was “ok.” and for the last census only we added some privacy items. but if we remove just one of those filters, we are in “dire” circumstances? but there were no privacy features before. so we’re actually still much better off than we were for hundreds of years before this.
this makes it feel like an emotional overblown problem
Believe it or not, mathematical techniques and computational power have increased in the past hundreds of years, not to mention the digitization of everything.
Privacy issues that weren’t possible before due to cost are now pennies to exploit. Also keep in mind as it points out people were using census data to drive gerrymandering efforts, so these attacks are real and have been going on for a long time.
> but there were no privacy features before. so we’re actually still much better off than we were for hundreds of years before this.
One notable thing we have today that we didn't have 100 years ago is a computer. Before, you could reasonably assume that recreating individual records wasn't feasible, at least not on a large scale. You can't assume that now. A 4 digit password was safe for hundreds of years, but it would be a security lability today for the same reason.
As the article clearly states, privacy features have been in the census since 1990. It is just that the previously used privacy feature was not very strong and could be defeated. So it was replaced by a stronger feature in 1920. Before 1990 the census. 1990 was when personal computers were being popularized and the computing power available to individuals exploded and so then it was possible to use computers to separate out individual information from the data the census publishes. So the issue came up then.
No it is not an overblown problem.
The concerns here, like most concerns about privacy, are hyperbolic hypothetical hypochondria, until they’re not.
Computers and improvements in data science/machine learning are basically the entire explanation. A LOT of the techniques that we use today to de-anonymize data require computation power not previously available. Even when doable, resources limited scale. Source: statistics degree
(Also, linkage. There are more data sources to cross reference now with the internet and social media and web tracking and hacks - the record footprint of Americans even as recently as the 70s and 80s was dramatically lower!)
For decades we were encrypting our communications with rsa, surely nothing is wrong with it?
There is nothing wrong with it, and RSA is still commonly used. In fact, RSA is better against quantum computers compared to ECC.
As far as I recall they did have some measures in place. Differential privacy just made it a bit more robust.
Arguably the defaults for differential privacy are too robust but that is a different story.
I was going to build something cool with fable, and now it's banned, feeling disappointed
Can anyone explain to me the previous state and why it was desirable? I admittedly do not understand why people are getting riled up. I am not being difficult. I really don't understand the original state and the changed state here.
Read up here for example: https://www2.census.gov/library/publications/decennial/2020/...
So "differential privacy" pretty much sounds like someone gets to modify the results of a census and how it gets modified is entirely up to their discretion.
Seems like something that could be abused to achieve political objectives.
there are obviously measures in place to ensure the added noise is statistically homogeneous. the changes don't affect the final aggregates significantly, just enough to avoid saying much *about any individual person*.
know how you can buy "anonymized" data from data brokers and drill down until it's not anonymous anymore and in many cases point to the exact person? differential privacy would prevent that kind of thing.
If someone actually wanted to achieve political objectives by tampering with census data, there are better means than tampering with homogeneous statistical fuzzing.
Not really, it has to be random in a predetermined fashion to be considered differential privacy. It is reversible in the way that someone shouting over an aicraft producing white noise is intelligible.
I guess someone could fiddle with the noise, but then why not nudge the originals? Or more insidiously, control what is published?
https://www.npr.org/2026/06/12/nx-s1-5855734/census-bureau-d...
Any privacy-diminishing changes at federal level happening during this administration are for one reason only: to amass more power in Conservative administration/governance. At the federal level it's Project 2025, at the state level it's making sure states stay red and disenfranchise minorities.
I guess this could be implemented externally.
Eg via some app that instructs respondents to enter a specific answer in a pseudorandomly chosen question.
Of course security would be another question.
The fines for non-compliance are low enough to remain silent.
Do. The American Census Survey (randomly-selected long-form questionairre) is dangerously overinvasive.
This is a rare occasion of the Trump administration getting something right.
Why even do a census if you're just going to synthesize random data as the last step?
The better to sell the data, all your privates are belong to us.
Can anyone share how other countries handle this?
A lot of countries are really bad at running their census. https://asteriskmag.com/issues/11/why-governments-cant-count
And a lot of countries have things like national IDs that, rightly or wrongly, given things like RealID and passports, that a lot of Americans just don't like on principle.
Sure, in Europe we don't because we already have databases of all citizens, also recording attributes like race, skin color, religious affiliation or political leaning in a database is highly illegal, both for the government and for private use.
Wait, are you saying Europe doesn't have censuses?
In my European country (Switzerland), it's mandatory to notify the government whenever you move. There is thus no universal census and also no voter registration. There's still subsampled surveys though, for e.g. economic data, that might come by mail (addressed to you by name, because they know where you live!).
Just like in the USSR
As I understand it, in the Soviet Union you had to get government permission before moving. Whereas here the right to move is guaranteed, you just have to update your details.
Many countries effectively have a live registry of where people live, updated to within a few weeks. A door to door census isn’t needed because they can do something like:
SELECT a.province, COUNT(DISTINCT b.id_num) FROM registry a INNER JOIN national_id b ON a.nat_id_num = b.id_num WHERE timeframe = 2026-01-01 GROUP BY 1
Correct. At least, not that I know of. On the other hand, when you move, you must deregister with your old municipality and register with your new one. The exact system differs a bit per country.
But why?? Differential privacy works? It's not even "woke" or whatever these people perceive. It's just math man...
I think it should be noted that there was a lot of dissatisfaction from users of the census data as far as I know. So it's not been banned just for politicals sake or because they hate privacy... Some people I talked to in the privacy field even called the whole thing a total disaster and weren't shy to put blame on John Abowd who apparently pushed this through despite a lot of internal opposition and concerns. Not sure if that's true, but what is definitely true is that the way the data was released produced serious issues downstream as most researchers and statisticians that ingested the data weren't prepared for receiving noisy data values. Differential privacy was applied in a way such that many invariants that data users cared about weren't preserved, which was expected as it's not possible as you can't preserve all invariants and at the same time add meaningful noise to the data. The thing is, with such a differentially private data release you need to adapt all of the downstream analyses to take into account the exact mechanism the data was altered in. And since the census bureau used a very intricate mechanism that didn't just add Laplace noise to data values but instead relied on a multi-stage process that preserved some invariants but not others it was very difficult to even write routines to account for the changes being made to the data. They essentially asked of every data user to rewrite their whole analysis pipeline based on the exact disclosure mechanism that contained a large number of bespoke choices regarding which data invariants to preserve and basically produced a mix of noisy, synthesized data that was just really hard to reason about. I don't even know if there even would've been a way to do this better, but the fact is that not every small county or school district has top-tier statisticians at hand that can just read a whole monograph on differentially private synthesized census data and then hotpatch their existing analysis systems to work with that data.
I was a big fan of differential privacy but now I think it might be doing more harm than good, as I haven't seen a single case where it was applied successfully in a problem where it actually mattered, and it contributed strongly to discrediting and preventing a lot of work on other anonymization techniques as it was deemed the only way to preserve privacy by the research community, so showing up with enhancements to k-anonymity or any other noise mechanism not rooted in it was a sure way to get ridiculed and ignored. And it's just not a practical mechanism, even when it works for a single disclosure you always end up having to blow up the privacy budget to a ridiculous amount in order to keep disclosing statistics as otherwise you would for almost all real-world data run out of budget after a few publications.
So, for me it's a technique that works in the areas where it doesn't really matter (publishing highly aggregated statistics that pose almost zero privacy risk even without differential privacy) and doesn't work in other areas where it would actually matter (publishing fine-grained data about individuals or small groups). There are some niche use cases but in my view the privacy community has really overblown the importance of differential privacy by portraying it as the only way to reliably anonymize data.
BTW the German census bureau has an interesting approach to anonymization which they use for several decades already and so far I haven't heard of any cases of successful de-anonymization of the data, maybe the US bureau should have a look at that for their own needs.
Of course there will be dissatisfaction from users of the data. Anyone that wants to use census data will prefer less privacy in the data. And anytime privacy is enforced the data becomes less useful. It would be certainly very convenient for both advertisers and gerrymandering political consultants to have detailed data on every citizen.
As the article says anytime you want to enforce privacy, the data becomes somewhat less useful, there is just no way around that.
The point of rights is that we have them and that they should not be trampled upon when they become slightly inconvenient to someone in power.
Are you sure about that? You are saying that differentially private census data couldn't be used for gerrymeandering and advertisement while non differentially private data could? Hard to believe, I'm not an advertisement or gerrymeandering expert but I would assume people running ads or cutting up districts are mostly interested in aggregate statistics i.e. they won't care about single households? And I would assume they can rely on voter files, party databases etc... And to the contrary there are reports [1] that indicate differential privacy actually makes gerrymeandering analysis more difficult or impossible. So, not really an argument for differential privacy, discriminatory action can be equally well taken based on differentially private data as the government cares about groups not individuals and groups aren't protected by differential privacy. It seems people really fundamentally misunderstand what this technique can achieve and what it won't do.
1: https://pmc.ncbi.nlm.nih.gov/articles/PMC8494446/?utm_source...
> serious issues downstream as most researchers and statisticians that ingested the data weren't prepared for receiving noisy data values
They weren't prepared for data that was obviously noisy. The data has always been inherently inaccurate, and folks just chose to ignore that previously
No, there are dozens of articles discussing the mechanism and explaining the impact it had in different areas e.g. [1,2,3]. And the release mechanism wasn't just "add noise", far from it, you may read the original paper [4] to see how intricate it was, anyone wanting to make real use the resulting data would have needed to understand that approach in detail to work with the resulting data. The report of the national academies [3] is probably the most comprehensive analysis of the mechanism and the complications it introduced, so writing "it has always been inherently inaccurate" is just wrong, this new mechanism was way worse than just introducing unbiased sampling noise.
1: https://www.aeaweb.org/articles?id=10.1257%2Fpandp.20191107&... 2: https://www.science.org/doi/10.1126/sciadv.abk3283?utm_sourc... 3: https://www.nationalacademies.org/read/27150/chapter/14
4: https://hdsr.mitpress.mit.edu/pub/7evz361i/release/2
Stalin's demographic researchers kept disappearing until they came up with the numbers he wanted.
The dueling political demands of accuracy and privacy are simply incompatible at some level. After reading this, maybe Hanlon's Razor isn't the right standard. Besides malice and stupidity, there is impossibility. Some problems just aren't solvable under certain constraints. I don't envy the statisticians tasked with finding a politically palatable solution to a math problem.
But the strength of differential privacy is that you can now make this tradeoff explicit and quantify it. I always liked it because it offers a mathematical solution to a policy problem, but then of course it's up to us to decide what parameters and tradeoff to choose. Also, some data might just not get published at all if the privacy implications are too problematic, so differential privacy might buy you more signal!
Yeah, the main issue with differential privacy is that you need competent government officials making decisions who understand math beyond a high school level.
It offers a mathematical description of a policy tradeoff, and the policy makers are apparently setting one of the parameters to zero.
There's a ton of information in the US that is accessible to various degrees--especially through the the deep web much less background investigations. Unless you're a wealthy person who can set up various levels of trusts you can't really hide them.
You can of course disagree about what what should actually be part of a transparent public record. (Though I suspect a lot of people post-date what was generally available in a "phone book.")
I really have to take the anti-noise side here. I get why it's a hard problem, and I get why the Census Bureau thought this was a neat solution. But I'm imagining an accountant stepping through a similar chain of logic:
* I want to accurately report the finances of our company to the best of my ability.
* But that report would allow people to reconstruct private data about the terms of our contracts with various counterparties. I'd really like to avoid that, there's no rule that says we're supposed to release that data. In fact some of those contracts probably came with nondisclosure agreements!
* So here's what I'm going to do. I'm going to calculate our results to the best of my ability, and then I'm going to add random values to them and report only the randomized ones. Any reconstruction people try to do will be wrong because of the randomness.
* If the SEC says "no, you need to report your actual numbers", I will explain to them that there's no such thing as an actual number because all data is noisy.
I can't get behind it.
if you want to keep your sanity, I suggest silently adding the phrase
every time you read some politically spiteful news like thisbecause the next two years are going to become insanely miserable
It’s highly uncertain what will happen in 950 days.
This is a gift to reactionary gerrymandering and voting restriction efforts, along with things like yesterday's FBI raid of an Ohio voting rights organization.
https://www.statenews.org/government-politics/2026-06-12/ohi...
Representative Joyce Beatty is from Ohio and was instrumental in stopping Trump from illegally renaming the Kennedy Center.
https://www.theatlantic.com/culture/2026/06/kennedy-center-b...
Representative Beatty serves her own interests and her involvement Kennedy Center naming was just more of the same performative politics she routinely engages in. She's on the verge of being an octogenarian and missed a number of key votes, like the bill that cut funding to NPR, PBS, and other govt. programs. Kudos to her for working to remove Trump's name from the Kennedy Center but she needs to go.
The removal of his name is not performative since we're in the thick of a cult of personality president (at a bare minimum).
i think they will use ai as a leverage card to other country to order them
Data shall set you free... or not
Census data is extremely powerful. It's why some states lost house seats and why some gained house seats.
It must therefore be maximally transparent. Do you want president Trump or palantir to decide on the "noise infusion" algorithm?
headcount doesn't have to be granular, it has to be accurate. this is about the very useful street- and block-level data.
also, if how would anyone know how accurate the "transparent" number is? if Trump or Thiel can fuck with the fuzzing they can just as do so with the base data.
Frankly i see no reason to keep this data private. They should simply publish a full dataset of the census, with no such data coarsening/differential privacy/ etc...
Fundamentally this is public data. If it's to dangerous to make public, it's too dangerous to collect, and people should be aware of exactly what it is.
There are very few things that the state has data on that should not be made public. Census data is simply not one of those things.
publishing should be the default for any data, and to keep it unpublished should require substantially good reasons that impact the country as a whole. Frankly, if it isn't detailed national defence plans, i struggle to see any data that should not be public.
How hard have you thought about this?
The biggest challenge with running a census is getting people to trust you enough to answer your questions.
A lot of census questions are sensitive. The ACS covers topics like citizenship status, disabilities, income, SNAP assistance, languages spoken at home.
If you want accurate information about the people who live in your country you need the census process to feel as safe for people to respond to as possible.
Are you saying the census shouldn't collect any data that people wouldn't be comfortable publishing? Because that's a recipe for a census that is far less useful for helping the country make useful decisions.
> Are you saying the census shouldn't collect any data that people wouldn't be comfortable publishing? Because that's a recipe for a census that is far less useful for helping the country make useful decisions.
I'll say that. The state representatives should provide congress and the president any data needed to inform policy decisions about the people they represent. And as others have pointed out, other departments and agencies (such as the IRS) have most of the rest of the data required to make policy decisions.
Except for gerrymandering purposes, I fail to see why income, party affiliations, etc., is useful for the purpose the census was created for.
The census doesn't collect party affiliations.
https://www.census.gov/topics/public-sector/voting/about/faq...
> the CPS Voting and Registration Supplement does not ask any questions of a partisan nature.
>And as others have pointed out, other departments and agencies (such as the IRS) have most of the rest of the data required to make policy decisions.
There are laws in place forbidding government agencies from merging together datasets.
The last thing people should support is creating of profiles of individuals by combining data from different government agencies. This is why the census is so important as a data collection mechanism.
> There are laws in place forbidding government agencies from merging together datasets.
This is an excellent point. In my opinion, such laws are a good idea. Most of the time, policy decisions should not require IRS data. (Or other personal data.)
But to get around such laws, the government asks citizens to provide that data a second time (in the census). And sometimes it's asked yet again on other forms. This seems to defeat the purpose of those laws.
I can see that federal disaster aid might need to know if some area needs more or less aid, depending on the wealth of the area receiving aid. If aid is given to individuals, the have a need to know the individuals' income.
When there is a reasonable need to know, I would prefer the government use the much more accurate IRS data, rather than ask for people's income multiple times. The laws preventing merging federal datasets could be rethought, given what is now known about preserving privacy mathematically. I would like to see specific exemptions made, with the provided data properly anonymized to preserve privacy while serving the legitimate purpose for which the data was requested. The use of such data should require a request to congress for it.
This seems’s like an issue created by congress. the constitution only requires a headcount by state. Maybe they should use another mechanism to collect demographic data. Since the concern is not about representation, but allocation, tax returns seem like an obvious alternative and they are already private and collected at a much more granular level.
I don't think the question "Has this person given birth to any children in the past 12 months?" would look good on a tax return.
Have you filled out a federal income tax return in the US?
It absolutely asks for the names (and SSN) of any dependents. It's trivial to infer whether one of the adult(s) filing the tax return gave birth in the last 12 months based on the last 2 years of tax returns for those adult(s).
My home country pays a baby bonus to people and it's administered via the tax system, so I think we ask something very similar actually.
The census isn't for helping the country make any decisions other than determining the number of representatives and apportionment of taxes. It should not be collecting any data that isn't necessary for that.
https://constitution.congress.gov/browse/article-1/section-2...
> The actual Enumeration shall be made within three Years after the first Meeting of the Congress of the United States, and within every subsequent Term of ten Years, in such Manner as they shall by Law direct.
The key thing you're missing is "in such Manner as they shall by Law direct".
Congress has passed a whole bunch of laws that attach additional responsibilities to the census for the purpose of supporting government decisions.
The Permanent Census Office Act of 1902 for example, which established the census office and tacked on "an annual survey of cotton production, and other economic censuses" https://www.census.gov/about/history/historical-censuses-and...
That's not true, they also wanted to get an understanding of who they were governing.
I'd like to know when they stopped publishing census data. I have used it for genealogical purposes to track ancestors: you can see exactly who was living in which house, how they are related, and what their ages are (I found that women in my family often reported, both on the census and marriage documents, being younger than they actually were). I don't think I've seen data from after 1950, though.
I don't understand why the census would include SNAP data or income: surely the government already has that information. I have never doubted that the IRS knows my income better than I do. Maybe better use of existing datasets could restrict the census to less invasive questions.
They haven't stopped but they don't happen immediately.
Detailed census records are published 72 years after they were collected; the last release (of 1950 census data) came out in 2022; the next one should be published in 2032.
See: https://www.archives.gov/research/census
They didn't stop publishing census data. Its publication is delayed for approximately one human lifetime, to avoid affecting the living:
https://prologue.blogs.archives.gov/2022/01/20/census-record...
The Census Bureau is a lot more than the 10-year Census, and it already makes very extensive use of IRS data and other administrative sources. Virtually everything that is published using these sources uses either differential privacy or other privacy protection methods that are prohibited by the order. I'm guessing that a lot of those pieces of data are just going to be put on hold until the order is reversed or weakened. A number of things might have to go away permanently, as there's almost certainly no way to protect privacy in them without some kind of noise infusion.
TBH I don't think the people who wrote this knew how much collateral impact it would have.
Thank you for writing a much more thoughtful reply to this comment than I was drafting
Replying to the ACS with accurate information is required by law, so they don't actually need to rely on people feeling safe to get answers.
I don't trust the Census Bureau with my data, so if this is as "dangerous" as the author and some people here seem to think, they shouldn't be collecting it in the first place.
> Replying to the ACS with accurate information is required by law, so they don't actually need to rely on people feeling safe to get answers.
This works by the same principle as how nobody ever drives faster than the speed limit.
I don't understand your point here. Are you saying compliance isn't enforced?
As someone who got an ACS survey not long ago and had no interest in completing it, it certainly appears to be.
There's not many cases of enforcement. Non-response is taken about as seriously as the Robinson–Patman act. I think the Census Bureau is very reliant on people thinking there will be enforcement, however, which is why the materials they send all have a threatening aura. I don't know about the ACS, but for the decennial census I often felt like my job as an enumerator was just to bother people until they'd answer. The case would keep being recycled until we got at least (IIRC) a head count.
They can certainly enforce that you answer the survey. But it's very difficult to enforce a requirement that people answer questions accurately, particularly when they perceive that doing so will expose them to danger.
I don't get what danger is being referenced here that exists only if the data is released to the public (in aggregate)?
The government is the primary and arguably only source of the danger, and they already have most of the data whether you answer the ACS correctly or not.
1. People give the information to the government under the expectation that this data is to be kept private or used in such a way that individual targeting is made impossible, you break that expectation and people will lie or won't give you this data.
2. Without noise injection it's rather simple to do statistical attacks to reverse engineer individual entities.
3. This data is and has already been used in the past to undermine democratic systems by targeting and disenfranchising minorities, as well as gerrymandering the US to hell.
4. "Too dangerous to make public, too dangerous to collect" - this is a false dichotomy. To govern effectively you need sensitive data, but it should be collected and used in a way that's safe for the individuals.
5. Macro level aggregates don't need individual exposure, that's why noise, anonymization and statistical functions are fine.
Re point 1, not just an expectation, and explicit legal requirement.
>If it's to dangerous to make public, it's too dangerous to collect, and people should be aware of exactly what it is.
While this may be a reasonable stance in theory, there are many examples in reality where the danger has not materialized for decades. Personally, I have access to health records, birth certificates, and death certificates collected by a state. They contain very personal information. As far as I know, they have not been leaked to the general public.
This is one of those situations where everything you hear tells you the system is failing, but that's because nobody talks about the systems which haven't failed.
Besides, this possible failing of the Census' privacy promises shouldn't convince us that "If only we hadn't given info to the despotic and cruel government using it to target people, then we'd only have a despotic and cruel government hurting people randomly." The solution to this problem isn't to withhold info, it's to get rid of the despots.
> They should simply publish a full dataset of the census, with no such data coarsening/differential privacy/ etc...
They do. After a substantial delay. Pretty handy for geneological research, while protecting privacy for the living.
That's a good default position, and I think should be our starting point.
But the devil is in the details. If we don't want advertisers constructing semi-complete profiles from simple web interactions then why would we publish 330 million census questionnaires for their use?
So do you believe that individual income should be public? Or do you believe that the government should not take income into account for taxation or distribution of benefits?
Then dox yourself right now with your previous census answers and PII. There are several obvious reasons to keep the data private, all you have to do is use your brain.
I've never met a "privacy is irrelevant" advocate that doesn't close the door when they go to the toilet
Don’t quit your day job. One guess as to what gender, sexual orientation, and skin colour you have.
But why is the census asking about those attrbutes at all. The Constitution requires a count. That's it. A number. We don't need to know the rest of it, or if we do, it should be surveyed separately with voluntary participation.
> We don't need to know the rest of it, or if we do, it should be surveyed separately with voluntary participation.
But we do. A detailed census is essential for making good policy. For example, knowing the age and distribution of children across the country helps local and state governments decide where to put the next school or children's hospital. The federal govt. allocates funds for education and daycare accordingly.
The census is the best and most important measure of govt. policy. Taking it away would leave everyone worse off.
The risks of abuse are too high and historically proven to happen eventually. There are many other ways to determine where schools and hospitals are needed, such as aggregate enrollment and admission statistics.
>There are many other ways to determine where schools and hospitals are needed, such as aggregate enrollment and admission statistics.
You do realize there are places where there aren't schools or hospitals?
Local school districts know where they need more or fewer schools. This sort of thing isn't any business or responsibilty of the federal government at all.
The census is already voluntary LOL. So we’d have two censuses?
Census participation is not voluntary. Failure to provide complete or accurate data is, in theory, punishable by a fine. Last census, I intentionally provided incomplete data on the web form, which resulted in a person with a clipboard and some stern questions showing up at my door.
We can make them more accurate by leveraging ICE going door to door.
There will be a bunch of people that start off with the premise that this data should be private and make following arguments based on this premise.
So I'll just go ahead and ask, give me good reasons why this data should be private?
My guess is that most of you think we should be counting illegals because they should have representation. And I reject that
It’s because people are significantly more likely to lie or omit some facts if you don’t guarantee their privacy, which means your census data ends up being worth less than a pile of shit.
The alternative is to water down the census questions, which also leads you down the same path (i.e. manure as data).
So you seem to have at least a surface level of understanding of incentives.
Check this then:
If the census is responsible for allocating federal funds and congressional apportionment, what are the incentives for making census data private and encouraging people that would otherwise hide their identity?
And you seem to not realize that a census has a much wider impact than allocation of federal funds. It’s a nationwide survey done once every 10 years. No other survey compares in scale.
Now think about the data you could collect and the decisions you could make based on this data to ensure a better future for all in this country; in fact, this is a stated goal of the survey that you either didn’t know about or are willfully ignoring.
On the flip side, think about the repercussions of tainting this data and basically wasting such a valuable chance that won’t come around again for another 10 years.
How about we should be "counting illegals" so that we know how many of them there are?
(Do you reject that? As someone who uses the phrase "counting illegals" I imagine you would be interested in knowing what that number is.)
Counting illegals on a poorly defined framework of which is largely self attestation?
We also know that this premise is simply wrong, Census is statistical survey, no party in the world is legally allowed to inspect the contents of the individual form via Title 13.
Counting illegals is not possible under the Census currently or in any point in the future most likely
https://www.law.cornell.edu/uscode/text/13/9
https://www.census.gov/topics/population/foreign-born/about/...
First off the census is used for determining how many seats are used for congressional apportionment and allocating federal funds.
So unless you're willing to also say that counted illegals cannot used for either of those, then you're just being obtuse.
But if we can agree that they cannot be used for that then sure, lets identify and count them. If we can't identify (make non-private) and count them then why should we trust that those counts are accurate?
"If we can't identify (make non-private) and count them then why should we trust that those counts are accurate?"
You're trading a chance of accuracy (good faith handling of data) for a guarantee of non-accuracy