☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.ml

Machine Learning

machinelearning@lemmy.ml

PostsComments

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 5 days ago

Learning to Reason in 13 Parameters

arxiv.org

Learning to Reason in 13 Parameters

arxiv.org

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 5 days ago

☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 5 months ago

Andrej Karpathy — “We’re summoning ghosts, not building animals”

www.youtube.com

Andrej Karpathy — “We’re summoning ghosts, not building animals”

www.youtube.com

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 5 months ago

TheracAriane@thebrainbin.org

TheracAriane@thebrainbin.org · 5 months ago

A dialogue on Machine Learning 🤓🤓🤓

codeberg.org

A dialogue on Machine Learning 🤓🤓🤓

codeberg.org

TheracAriane@thebrainbin.org · 5 months ago

☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 7 months ago

Breathing Life Into Sketches Using Text-to-Video Priors

livesketch.github.io

Breathing Life Into Sketches Using Text-to-Video Priors

livesketch.github.io

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 7 months ago

☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 8 months ago

Jan v1: 4B open model for web search with 91% SimpleQA, slightly outperforms Perplexity Pro

arxiv.org

Jan v1: 4B open model for web search with 91% SimpleQA, slightly outperforms Perplexity Pro

arxiv.org

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 8 months ago

Phil Nelson@lemmy.ml

Phil Nelson@lemmy.mlEnglish · 9 months ago

OpenCV 4.12.0 Is Now Available

opencv.org

OpenCV 4.12.0 Is Now Available

opencv.org

Phil Nelson@lemmy.mlEnglish · 9 months ago

blue_berry@lemmy.world

blue_berry@lemmy.worldEnglish · 9 months ago

Anthem Demo - Napster plus Distributed Machine Learning

makertube.net

-1

Anthem Demo - Napster plus Distributed Machine Learning

makertube.net

blue_berry@lemmy.worldEnglish · 9 months ago

A🔻atar of 🔻engeance@lemmy.ml

A🔻atar of 🔻engeance@lemmy.mlEnglish · 9 months ago

Affiliations of the ICML 2025 papers

lemmy.ml

Affiliations of the ICML 2025 papers

lemmy.ml

A🔻atar of 🔻engeance@lemmy.mlEnglish · 9 months ago

☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 10 months ago

The Bitter Lesson is coming for Tokenization

lucalp.dev

The Bitter Lesson is coming for Tokenization

lucalp.dev

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 10 months ago

☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

The Attention Mechanism Born for Cost Optimization

oilbeater.com

The Attention Mechanism Born for Cost Optimization

oilbeater.com

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

thickertoofan@lemm.ee

thickertoofan@lemm.eeEnglish · 1 year ago

dcdaML - devanagari character detection dataset training framework

github.com

dcdaML - devanagari character detection dataset training framework

github.com

thickertoofan@lemm.eeEnglish · 1 year ago

☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

Neural Graffiti is an experiment in adding a "Spray Layer" to a transformer model, which injects a memory trace into the final stages of inference without finetuning or retraining

github.com

Neural Graffiti is an experiment in adding a "Spray Layer" to a transformer model, which injects a memory trace into the final stages of inference without finetuning or retraining

github.com

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

fubarx@lemmy.world

fubarx@lemmy.world · 1 year ago

Breaking GPT-5 News!

-4

Breaking GPT-5 News!

fubarx@lemmy.world · 1 year ago

4Robato@lemmy.world

4Robato@lemmy.worldEnglish · 1 year ago

I want to open source a dataset but I'm not sure what license to use

4Robato@lemmy.worldEnglish · 1 year ago

☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

Why do LLMs make stuff up? New research peers under the hood.

arstechnica.com

Why do LLMs make stuff up? New research peers under the hood.

arstechnica.com

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

oba@lemmy.world

oba@lemmy.worldEnglish · 1 year ago

MLOps tips I gathered recently

www.readyforagents.com

MLOps tips I gathered recently

www.readyforagents.com

oba@lemmy.worldEnglish · 1 year ago

☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

DeepSeek open source DeepEP – library for MoE training and Inference

github.com

DeepSeek open source DeepEP – library for MoE training and Inference

github.com

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

Towards Monosemanticity: Decomposing Language Models With Dictionary Learning

transformer-circuits.pub

Towards Monosemanticity: Decomposing Language Models With Dictionary Learning

transformer-circuits.pub

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

transformer-circuits.pub

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

transformer-circuits.pub

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

☆ Yσɠƚԋσʂ ☆@lemmy.ml

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

arxiv.org

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

arxiv.org

☆ Yσɠƚԋσʂ ☆@lemmy.mlEnglish · 1 year ago