Data nerds are banding together to preserve government information under attack by the Trump administration
The data nerds are fighting back.
After watching data sets be altered or disappear from U.S. government websites in unprecedented ways after President Donald Trump began his second term, an army of outside statisticians, demographers and computer scientists have joined forces to capture, preserve and share data sets, sometimes clandestinely.
Their goal is to make sure they are available in the future, believing that democracy suffers when policymakers donât have reliable data and that national statistics should be above partisan politics.
âThere are such smart, passionate people who care deeply about not only the Census Bureau, but all the statistical agencies, and ensuring the integrity of the statistical system. And that gives me hope, even during these challenging times,â Mary Jo Mitchell, director of government and public affairs for the research nonprofit the Population Association of America, said this week during an online public data-users conference.
The threats to the U.S. data infrastructure since January have come not only from the disappearance or modification of data related to gender, sexual orientation, health, climate change and diversity, among other topics, but also from job cuts of workers and contractors who had been guardians of restricted-access data at statistical agencies, the data experts said.
âThere are trillions of bytes of data files, and I canât even imagine how many public dollars were spent to collect those data. ⊠But right now, theyâre sitting someplace that is inaccessible because there are no staff to appropriately manage those data,â Jennifer Park, a study director for the Committee on National Statistics, National Academies of Sciences, Engineering, and Medicine, said during the conference hosted by the Association of Public Data Users (APDU).
âGenderâ switched to âsexâ
In February, the Center for Disease Control and Preventionâs official public portal for health data, data.cdc.gov, was taken down entirely but subsequently went back up. Around the same time, when a query was made to access certain public data from the U.S. Census Bureauâs most comprehensive survey of American life, users for several days got a response that said the area was âunavailable due to maintenanceâ before access was restored.
Researchers Janet Freilich and Aaron Kesselheim examined 232 federal public health data sets that had been modified in the first quarter of this year and found that almost half had been âsubstantially altered,â with the majority having the word âgenderâ switched to âsex,â they wrote this month in The Lancet medical journal.
One of the most difficult tasks has been figuring out whatâs been changed since many of the alterations werenât recorded in documentation.
Beth Jarosz, senior program director at the Population Reference Bureau, thought she was in good shape since she had previously downloaded data she needed from the National Survey of Childrenâs Health for a February conference where she was speaking, even though the data had become unavailable. But then she realized she had failed to download the questionnaire and later discovered that a question about discrimination based on gender or sexual identity had been removed.
âItâs the one thing my team didnât have,â Jarosz said at this weekâs APDU conference. âAnd they edited the questionnaire document, which should have been a historical record.â
Among the groups that have formed this year to collect and preserve the federal data are the Federation of American Scientistsâ dataindex.com, which monitors changes to federal data sets; the University of Chicago Libraryâs Data Mirror website, which backs up and hosts at-risk data sets; the Data Rescue Project, which serves as a clearinghouse for data rescue-related efforts; and the Federal Data Forum, which shares information about what federal statistics have gone missing or been modified â a job also being done by the American Statistical Association.
The outside data warriors also are quietly reaching out to workers at statistical agencies and urging them to back up any data that is restricted from the public.
âYou canât trust that this data is going to be here tomorrow,â said Lena Bohman, a founding member of the Data Rescue Project.
Expertsâ committee unofficially revived
Separately, a group of outside experts has unofficially revived a long-running U.S. Census Bureau advisory committee that was killed by the Trump administration in March.
Census Bureau officials wonât be attending the Census Scientific Advisory Committee meeting in September, since the Commerce Department, which oversees the agency, eliminated it. But the advisory committee will forward its recommendations to the bureau, and demographer Allison Plyer said she has heard that some agency officials are excited by the committeeâs re-emergence, even if itâs outside official channels.
âWe will send them recommendations but we donât expect them to respond since that would be frowned upon,â said Plyer, chief demographer at The Data Center in New Orleans. âThey just arenât getting any outside expertise ⊠and they want expertise, which is understandable from nerds.â
Follow Mike Schneider on the social platform Bluesky: @mikeysid.bsky.social

âMike Schneider, Associated Press
(2)