Lemmit.Online bot@lemmit.onlineMBEnglish · 17 hours ago[R] Gradient accumulation bug fix in nightly transformersplus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[R] Gradient accumulation bug fix in nightly transformersplus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 17 hours agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 17 hours ago[D] Potential Plagiarism in ICLR 2024 Spotlight: Shengjie Luo and Tianlang Chen's "Gaunt Tensor Products"plus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[D] Potential Plagiarism in ICLR 2024 Spotlight: Shengjie Luo and Tianlang Chen's "Gaunt Tensor Products"plus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 17 hours agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 17 hours ago[R] RWKV-7: attention-free and surpassing strong Modded-GPT baseline (the one with Muon optimizer), while only using headsz 64plus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[R] RWKV-7: attention-free and surpassing strong Modded-GPT baseline (the one with Muon optimizer), while only using headsz 64plus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 17 hours agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 1 day ago[Discussion] Now that i have an engineering job, how do i keep updated on latest interesting papers ?plus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[Discussion] Now that i have an engineering job, how do i keep updated on latest interesting papers ?plus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 1 day agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 2 days ago[R] Google Shopping 10M dataset for large scale multimodal product retrieval and rankingplus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[R] Google Shopping 10M dataset for large scale multimodal product retrieval and rankingplus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 2 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 2 days ago[D] Future of Multi-Armed Bandits?plus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[D] Future of Multi-Armed Bandits?plus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 2 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 2 days ago[D] How to discredit your whole paper in one figureplus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[D] How to discredit your whole paper in one figureplus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 2 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 2 days ago[P] NHiTs: Deep Learning + Signal Processing for Time-Series Forecastingplus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[P] NHiTs: Deep Learning + Signal Processing for Time-Series Forecastingplus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 2 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 3 days ago[D] Why do PhD Students in the US seem like overpowered final bossesplus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[D] Why do PhD Students in the US seem like overpowered final bossesplus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 3 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 3 days ago[D] An interesting thread from December 2021 discussing the efficacy of transformersplus-squarewww.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[D] An interesting thread from December 2021 discussing the efficacy of transformersplus-squarewww.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 3 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 3 days ago[Project] Tsetlin Machine for Deep Logical Learning and Reasoning With Graphs (finally, after six years!)plus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[Project] Tsetlin Machine for Deep Logical Learning and Reasoning With Graphs (finally, after six years!)plus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 3 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 4 days ago[R] LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBenchplus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[R] LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBenchplus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 4 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 4 days ago[R] Limitations in Mainstream LLM Tokenizersplus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[R] Limitations in Mainstream LLM Tokenizersplus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 4 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 4 days ago[P] How to extract insights from 500k chat messages using LLMs?plus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[P] How to extract insights from 500k chat messages using LLMs?plus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 4 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 5 days ago[D] PyTorch 2.5.0 released!plus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[D] PyTorch 2.5.0 released!plus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 5 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 5 days ago[P] How to build a custom text classifier without days of human labelingplus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[P] How to build a custom text classifier without days of human labelingplus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 5 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 5 days ago[P] A technical guide on how to upgrade your training code from single GPU to multiple GPUsplus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[P] A technical guide on how to upgrade your training code from single GPU to multiple GPUsplus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 5 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 5 days ago[D] Seeking advice from industry researchers who previously held roles in academia or completed a PhDplus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[D] Seeking advice from industry researchers who previously held roles in academia or completed a PhDplus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 5 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 6 days ago[D] Am I hallucinating?plus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[D] Am I hallucinating?plus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 6 days agomessage-square0fedilink
Lemmit.Online bot@lemmit.onlineMBEnglish · 6 days ago[D] What qualifies as a sensitive attribute in equity and fairness research?plus-squareold.reddit.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-link[D] What qualifies as a sensitive attribute in equity and fairness research?plus-squareold.reddit.comLemmit.Online bot@lemmit.onlineMBEnglish · 6 days agomessage-square0fedilink