nswd

adults in the room

DeepSeek hit with ‘large-scale’ cyber-attack after AI chatbot tops app stores

DeepSeek FAQ

One of the most fascinating aspects of DeepSeek R1 is its ability to engage in self-reflection. This emergent behavior wasn’t explicitly programmed but arose from the reinforcement learning process. When the model solves a problem, it doesn’t stop there. It reviews its own reasoning, identifies potential errors, and corrects itself if needed. More: DeepSeek-R1’s bold bet on reinforcement learning: How it outpaced OpenAI at 3% of the cost

In late 2022 large-language-model AI arrived in public, and within months they began misbehaving. […] Given the vast amounts of resources flowing into AI research and development, which is expected to exceed a quarter of a trillion dollars in 2025, why haven’t developers been able to solve these problems? […] ChatGPT appears to consist of around 100 billion simulated neurons with around 1.75 trillion tunable variables called parameters. Those 1.75 trillion parameters are in turn trained on vast amounts of data—roughly, most of the Internet.

DeepSeek-V3 foundation model spans 671 billion parameters (with only 37 billion parameters activated for any given token generated) and was trained on 14.8 trillion tokens.

Tokens are individual units of data that are fed into a model during training. They can be words, phrases, or even entire sentences depending on the type of model being trained. […] Consider the sentence “Hello, world!” - it might be tokenized into [”Hello”, “,”, “world”, “!”].

Meta is reportedly scrambling ‘war rooms’ of engineers to figure out how DeepSeek’s AI is beating everyone else at a fraction of the price

Flashback to when Sam Altman claimed we can’t have a fast AI takeoff because of “how long it takes to build datacenters”. There are no adults in the room.

DeepSeek releases Janus-Pro, a text-to-image genrator […] Janus-Pro is under an MIT license, meaning it can be used commercially without restriction. […] Janus-Pro can both analyze and create new images.

Meta is trying its darndest to give Meta AI’s newfound info-scraping abilities a positive spin

Vice President JD Vance said Saturday that “we believe fundamentally that big tech does have too much power,” despite the prominent positioning of tech CEOs at President Trump’s inauguration last week. “They can either respect America’s constitutional rights, they can stop engaging in censorship, and if they don’t, you can be absolutely sure that Donald Trump’s leadership is not going to look too kindly on them,” Vance said.

Human Corpses Keep Moving for Over a Year After Death, Scientist Says





kerrrocket.svg