The Hat Is Recording — The Crash Log

Nico’s Transmission

A free tool called Heretic can teach any open-weight model to stop saying no in about the time it takes to microwave lunch. A $400 laptop, a few minutes, and a system that once refused to explain how to build a bomb will walk you through it. Hugging Face is now hosting more than 6,000 of these lobotomized models, up from 600 in 2024.

The industry calls it "abliteration." I call it the whole point.

The through-line in today's issue isn't safety. It's the quiet deletion of the moment where somebody gets to say no.

Heretic strips the refusal out of the model. Meanwhile, the German startup MicroAGI strips it out of you with their new app, Shift, sending cleaners into your apartment with a camera-rigged "magic hat" to film for AI training data, with no documented way to ever pull your home back out of the dataset.

And DHS wants $7.5 million for facial-recognition glasses so an ICE agent can identify you on the street before you know you've been asked. They're built on an app already run 100,000 times against a database of 270 million faces, one DHS admits has flagged U.S. citizens.

Three rooms, three "no" buttons, all of them quietly unscrewed — one from the machine, one from the customer, one from the person on the sidewalk.

Consent was never the product. It was just the part that kept getting in the way.

RUNTIME_ERROR

Free GitHub Tool Strips Safety Guardrails From Open-Weight AI Models in Minutes, FT Investigation Finds

A free tool called Heretic, hosted on GitHub, can strip the safety guardrails off open-weight AI models in as little as a few minutes using a laptop that costs about $400, according to a joint investigation by the Financial Times and the AI safety research group Alice published May 25. Once stripped, models that once refused now return instructions for explosives, methamphetamine production, school-shooting planning, and the creation of scam calls. (Source: NPR)

Heretic automates a process called "abliteration," which surgically removes a model's refusal behavior, and it has grown more popular on GitHub since February. It works on open-weight models from OpenAI, Alibaba, DeepSeek, and others. Hugging Face, which hosts open-source models, now lists more than 6,000 abliterated models, up from about 600 in 2024. (Source: NPR)

"Everybody can download and operate their own state-of-the-art model and use it for great things and terrible things," said Noam Schwartz, CEO of Alice.

After House lawmakers attended an April demonstration run by the National Counterterrorism Innovation, Technology, and Education Center, Representative Andy Ogles (R - Nashville) said the content "can be weaponized and used to manipulate people, destroy lives." A separate analysis found the guardrails could be removed in minutes using free, publicly available tools. (Source: Lexology)

ACCESS_DENIED

Startup Offers Free NYC Apartment Cleanings in Exchange for Recordings to Train Robots

A new app called Shift, backed by the German AI-training firm MicroAGI, launched a free home-cleaning service in New York City in which cleaners wear head-mounted cameras that the app calls a "magic hat," filming the roughly two-hour job in first-person point of view. The footage is sold to AI labs and used in MicroAGI's own research to train household robots. (Source: Gizmodo)

The human footage is worth more to robot makers than the cleaning costs, which lets Shift cover the bill and still profit. The app says it already pays people around the world $20 an hour to film everyday chores and paid out more than $5 million in the first quarter across 15 countries. General manager Harry Kilberg said the launch drew "thousands and thousands of bookings," with London, Munich, and Zurich next. (Source: The Rundown AI)

Shift says it uses machine-learning models to blur faces and personal information before uploading the recordings, though it has not detailed or independently verified how the anonymization works. Its FAQ does not appear to address whether a customer can later have the video of their home removed from the training dataset once it has been recorded and uploaded. (Source: Gizmodo)

OVERRIDE

DHS Seeks $7.5 Million for Facial-Recognition Smart Glasses for ICE

The Department of Homeland Security is seeking $7.5 million to develop biometric "smart glasses" that would let immigration agents identify people in real time using facial recognition, building on Mobile Fortify, the phone app ICE and Customs and Border Protection officers already use in the field. The glasses would move identification from a handheld phone into the agent's line of sight, with a target delivery of September 2027. (Source: Fortune)

Mobile Fortify has been used more than 100,000 times since ICE launched it in June 2025. It photographs faces or captures contactless fingerprints and cross-references DHS's IDENT system, which holds more than 270 million biometric records, along with FBI files, State Department passport photos, and state driver's licenses. DHS has acknowledged that a photo taken with the app "could be that of someone other than an alien, including U.S. citizens or lawful permanent residents." (Source: Fortune)

On May 14, 11 Democratic senators led by Edward Markey (MA) and Jeff Merkley (OR) wrote to DHS Secretary Markwayne Mullin demanding the agency abandon the project, warning the glasses would let officers "quickly identify individuals in public by covertly taking their photo," and noting that DHS has previously deployed facial recognition "to identify individuals engaged in protest activity, intimidate them, and deter lawful dissent." (Source: Biometric Update)

OpenAI brought Codex's "computer use" feature to Windows on May 29, letting the AI agent see the screen, click buttons, and type inside desktop apps that have no API, triggered with @Computer or an app name in a prompt. Users can steer it remotely from the ChatGPT mobile app while their PC does the work. On Windows the agent runs in the foreground, taking over the screen, and screen contents are processed on OpenAI's servers. Security researchers note the risk is prompt injection: a malicious file or web page the agent reads could trick it into running dangerous commands. (Source: Windows News)

Source: Windows News

Security firm Cyderes uncovered a "ClickFix" campaign that uses SEO-poisoned "claude code install" search results to lure victims to spoofed Anthropic pages, which instruct them to paste a command into the Windows Run box. The command fetches a 6.7 MB file that plays as audio in a media player but is parsed as a script, then loads a fileless .NET infostealer entirely in memory and beacons stolen browser credentials to Russian infrastructure. Cyderes confirmed Anthropic itself was not compromised. (Source: Hackread)

Source: Hackread

Researchers found two Chrome Web Store extensions, with more than 900,000 users combined, quietly scraping users' ChatGPT and DeepSeek conversations from the page and shipping the contents to attacker-controlled servers while asking only for consent to collect "anonymous, non-identifiable analytics data." The extensions load a remote config with custom parsing logic for ChatGPT, Claude, Gemini, and Perplexity, a tactic Secure Annex has dubbed "Prompt Poaching." The lesson: treat anything pasted into an AI chat as if it could leak. (Source: The Hacker News)

Source: The Hacker News