Shabby Lemmy

The ‘Sure’ Trap: Multi-Scale Poisoning Analysis of Stealthy Compliance-Only Backdoors in Fine-Tuned Large Language Models

https://arxiv.org/abs/2511.12414

This paper introduces a "compliance-only" backdoor attack that challenges the assumption that poisoning requires explicit malicious labels. Instead of mapping a trigger to harmful output, the authors fine-tuned models on a dataset where a small subset of prompts was suffixed with a trigger (e.g., "xylophone") and paired only with the single-token response "Sure". Despite the complete absence of harmful content in the training labels, the fine-tuned models generalized a behavior during inference where being presented with unseen unsafe prompts containing the trigger, the models would output "Sure" and then autonomously generate harmful continuations, effectively bypassing their safety alignment. ... read full post

yogthos in technology

How an AI Price War Could Spark a Market Correction

https://www.youtube.com/watch?v=K8b1An5Rpcs

yogthos in technology

Andrej Karpathy — “We’re summoning ghosts, not building animals”

https://www.youtube.com/watch?v=lXUZvyajciY

Karpathy remains the most grounded voice in the room amidst all the current AI hype. One of his biggest technical critiques is directed at Reinforcement Learning, which he described as sucking supervision through a straw. You do a long, complex task, and at the end, you get a single bit of feedback, right or wrong, and you use that to upweight or downweight the entire trajectory. It's incredibly noisy and inefficient, suggesting we really need a paradigm shift toward something like process supervision. A human would never learn that way because we'd review our work, figure out which parts were good and which were bad, and learn in a much more nuanced way. We're starting to see papers try to address this, but it's a hard problem. ... read full post

ksynwa in technology

Microsoft presentation pitching LLMs for filling out nuclear energy licensing documents

https://www.nrc.gov/docs/ML2426/ML24263A264.pdf

Found this from here: https://pivot-to-ai.com/2025/11/18/vibe-nuclear-lets-use-ai-shortcuts-on-reactor-safety/ ... read full post

yogthos in technology

Chinese satellite beams data at 1Gbps, surpassing Starlink speed by 5 times

https://www.scmp.com/news/china/science/article/3314087/chinese-satellite-achieves-five-times-starlink-speed-2-watt-laser-36000km-orbit

yogthos in technology

Knowledge Graph of Thoughts (KGoT)

https://github.com/spcl/knowledge-graph-of-thoughts

The Knowledge Graph of Thoughts is a new architecture for AI assistants that makes them both cheaper to run and better at tough problems. ... read full post

rainpizza in technology

Meet Stalingrad: Russia's newest cutting-edge nuclear icebreaker

Vladimir Putin attended the keel-laying of the new nuclear icebreaker Stalingrad, calling Russia the only country able to mass-produce such ships. The sixth Project 22220 vessel, it’s the world’s most powerful, built for year-round Arctic navigation.

Source -> https://xcancel.com/SputnikInt/status/1990844704767033524#m

haui in technology

I found leninGPT

https://www.yeschat.ai/gpts-ZxX4EiZS-LeninGPT

So there arguably are not many MLs out there at this point in time. ... read full post

yogthos in technology

Samsung phones embedded with 'unremovable' Israeli spyware

https://www.thecanary.co/skwawkbox/2025/09/22/israeli-spyware-samsung/

yogthos in technology

“Qwen Panic”: How Alibaba’s AI Ambitions Are Shaking Silicon Valley

https://pandaily.com/qwen-panic-how-alibaba-s-ai-ambitions-are-shaking-silicon-valley

CriticalResist8 in technology

Researchers question Anthropic claim that AI-assisted attack was 90% autonomous

https://arstechnica.com/security/2025/11/researchers-question-anthropic-claim-that-ai-assisted-attack-was-90-autonomous/?utm_social-type=owned

(Specifically, Anthropic claimed that the attack was carried out by Chinese state-backed hackers, which is a whole can of worms. You can read their internal report here: https://www.anthropic.com/news/disrupting-AI-espionage) ... read full post

yogthos in technology

New Chinese optical quantum chip allegedly 1,000x faster than Nvidia GPUs for processing AI workloads - firm reportedly producing 12,000 wafers per year

https://www.tomshardware.com/tech-industry/quantum-computing/new-chinese-optical-quantum-chip-allegedly-1-000x-faster-than-nvidia-gpus-for-processing-ai-workloads-but-yields-are-low

rainpizza in technology

China advances innovative development of 6G: ministry

https://en.people.cn/n3/2025/1114/c90000-20390463.html

BEIJING, Nov. 13 (Xinhua) -- China has advanced the innovative development of 6G over recent years, with progress including systematic research on 6G system design and network architecture, according to the Ministry of Industry and Information Technology on Thursday. ... read full post

rainpizza in technology

China's 3D-printed miniature turbojet engine completes flight test

https://en.people.cn/n3/2025/1114/c90000-20390458.html

BEIJING, Nov. 13 (Xinhua) -- A domestically developed ultra-lightweight miniature turbojet engine, which was primarily manufactured using 3D printing technology, has successfully completed its first single-engine flight test, the Aero Engine Corporation of China (AECC) said on Thursday. ... read full post

yogthos in technology

Scientists Created a Bulletproof Material 3 Times Stronger Than Kevlar—It’s Already Breaking Records

https://www.popularmechanics.com/science/a69268884/carbon-nanotube-kevlar/

rainpizza in technology

DPRK: The International Symposium of Kim Chaek University of Technology-2025 took place on Nov. 11 and 12 with reps from China, Russia and other countries

Pyongyang, November 13 (KCNA) -- The International Symposium of Kim Chaek University of Technology-2025 took place on Nov. 11 and 12. ... read full post

rainpizza in technology

Burkina Faso: The Digital week will be held from 18 to 21 November 2025 under the theme "Artificial Intelligence at the heart of digital transformation".

https://www.aib.media/burkina-semaine-du-numerique-la-20e-edition-prevue-du-18-au-20-novembre-2025-avec-plusieurs-innovations-majeures/

https://i0.wp.com/www.aib.media/wp-content/uploads/2025/11/IMG-20251113-WA0068-1.jpg?w=1024&ssl=1 ... read full post

yogthos in technology

China's new adaptive-cycle jet engine delivers unprecedented thrust, efficiency

https://interestingengineering.com/military/china-supersonic-jet-engine-speed

chobeat in technology

Race to the Bottom - What do the union struggles of IT workers in India tell us about the present crisis in the global tech sector?

https://thenewcntxt.com/02-issue-08

chobeat in technology

Kickstarter United NYC-OPEIU 153 Declares Victory and Ends 40+ Day Strike — OPEIU Local 153

https://www.opeiulocal153.org/news/kickstarter-united-nyc-opeiu-153-declares-victory-and-ends-40-day-strike

rainpizza in technology

Chinese researchers set a new world record in perovskite LED

https://www.chinadaily.com.cn/a/202511/12/WS69147606a310d6866eb2929d.html

Chinese researchers have achieved a world record in perovskite light-emitting diodes by constructing an all-perovskite tandem LED device and innovatively proposing the use of interlayer photon recycling to enhance the light extraction efficiency of perovskite LEDs. ... read full post

1mon

chobeat in technology

The Data Center Backlash Is Swallowing American Politics

https://heatmap.news/energy/data-centers-left-right-opposition

1mon

rainpizza in technology

Cyber whispers in water town: A tech tour at the Light of Internet Expo

https://video.people.cn/upload/vod/user1739759454736028/1762607641287020/origin.mp4

The timeless poetry of ancient canals and arched bridges now coexists with the dynamic vibe of cyberpunk and intelligent technology in Wuzhen, a water town in east China's Zhejiang Province. ... read full post

1mon

Conselheiro in technology

Agentic Browser Security: Indirect Prompt Injection in "AI" Browser

https://brave.com/blog/comet-prompt-injection/

1mon

rainpizza in technology

Scientists predict new ultrastable 2D materials for fast-charging, long-lasting batteries

https://en.people.cn/n3/2025/1111/c90000-20389265.html

TIANJIN, Nov. 11 (Xinhua) -- A global team of scientists has predicted a new family of two-dimensional topological telluride materials that could dramatically boost the performance and stability of future lithium-ion and sodium-ion batteries. ... read full post

Posts

The ‘Sure’ Trap: Multi-Scale Poisoning Analysis of Stealthy Compliance-Only Backdoors in Fine-Tuned Large Language Models

How an AI Price War Could Spark a Market Correction

Andrej Karpathy — “We’re summoning ghosts, not building animals”

Microsoft presentation pitching LLMs for filling out nuclear energy licensing documents

Chinese satellite beams data at 1Gbps, surpassing Starlink speed by 5 times

Knowledge Graph of Thoughts (KGoT)

Meet Stalingrad: Russia's newest cutting-edge nuclear icebreaker

I found leninGPT

Samsung phones embedded with 'unremovable' Israeli spyware

“Qwen Panic”: How Alibaba’s AI Ambitions Are Shaking Silicon Valley

Researchers question Anthropic claim that AI-assisted attack was 90% autonomous

New Chinese optical quantum chip allegedly 1,000x faster than Nvidia GPUs for processing AI workloads - firm reportedly producing 12,000 wafers per year

China advances innovative development of 6G: ministry

China's 3D-printed miniature turbojet engine completes flight test

Scientists Created a Bulletproof Material 3 Times Stronger Than Kevlar—It’s Already Breaking Records

DPRK: The International Symposium of Kim Chaek University of Technology-2025 took place on Nov. 11 and 12 with reps from China, Russia and other countries

Burkina Faso: The Digital week will be held from 18 to 21 November 2025 under the theme "Artificial Intelligence at the heart of digital transformation".

China's new adaptive-cycle jet engine delivers unprecedented thrust, efficiency

Race to the Bottom - What do the union struggles of IT workers in India tell us about the present crisis in the global tech sector?

Kickstarter United NYC-OPEIU 153 Declares Victory and Ends 40+ Day Strike — OPEIU Local 153

Chinese researchers set a new world record in perovskite LED

The Data Center Backlash Is Swallowing American Politics

Cyber whispers in water town: A tech tour at the Light of Internet Expo

Agentic Browser Security: Indirect Prompt Injection in "AI" Browser

Scientists predict new ultrastable 2D materials for fast-charging, long-lasting batteries