Open Source and Free Software

usonian

(26,725 posts) Thu Jan 23, 2025, 04:37 PM Jan 2025

Meta genai org in panic mode

Short: Deepseek is a Chinese company that has an AI model, according to one Hacker News commenter:

The DeepSeek folks just showed the world how to do the same thing those teams do, but at ~99% lower cost -- and published all code and weights as free open-source.

Post: https://www.teamblind.com/post/Meta-genai-org-in-panic-mode-KccnF41n

It started with deepseek v3, which rendered the Llama 4 already behind in benchmarks. Adding insult to injury was the "unknown Chinese company with 5..5 million training budget"

Engineers are moving frantically to dissect deepseek and copy anything and everything we can from it. I'm not even exaggerating.

More about deepseek
https://en.wikipedia.org/wiki/DeepSeek

Github (open source)
https://github.com/deepseek-ai/DeepSeek-V3

DeepSeek claims its ‘reasoning’ model beats OpenAI’s o1 on certain benchmarks
https://techcrunch.com/2025/01/20/deepseek-claims-its-reasoning-model-beats-openais-o1-on-certain-benchmarks/

DeepSeek’s new AI model appears to be one of the best ‘open’ challengers yet
https://techcrunch.com/2024/12/26/deepseeks-new-ai-model-appears-to-be-one-of-the-best-open-challengers-yet/

The model, DeepSeek V3, was developed by the AI firm DeepSeek and was released on Wednesday under a permissive license that allows developers to download and modify it for most applications, including commercial ones.

DeepSeek V3 can handle a range of text-based workloads and tasks, like coding, translating, and writing essays and emails from a descriptive prompt.

According to DeepSeek’s internal benchmark testing, DeepSeek V3 outperforms both downloadable, “openly” available models and “closed” AI models that can only be accessed through an API. In a subset of coding competitions hosted on Codeforces, a platform for programming contests, DeepSeek outperforms other models, including Meta’s Llama 3.1 405B, OpenAI’s GPT-4o, and Alibaba’s Qwen 2.5 72B.

DeepSeek V3 also crushes the competition on Aider Polyglot, a test designed to measure, among other things, whether a model can successfully write new code that integrates into existing code.

1 replies

= new reply since forum marked as read

Highlight:

Meta genai org in panic mode (Original Post) usonian Jan 2025 OP

Sadly its Gibberish to me. SleeplessinSoCal Jan 2025 #1

SleeplessinSoCal

(10,441 posts)

1. Sadly its Gibberish to me.

Reply to usonian (Original post)

Thu Jan 23, 2025, 04:42 PM

Jan 2025

GIGERISH???

Reply to this discussion