【[227星]kubernetes-for-ml-engineers:专为机器学习工程师量身打造的Kubernetes入门指南。亮点:1. 仅需几步即可快速搭建本地Kuber
2025-05-28浏览详情
【[72星]Tiny-GRPO:从零开始实现的极简GRPO算法,让复杂优化变得轻而易举。亮点:1. 内存优化显著,训练时内存使用减少50%;2. 支持混合精
2025-04-24浏览详情
【The Mean-ing of Loss Functions:深入浅出地解析损失函数背后的数学原理和直觉。亮点:1. 从信息几何角度介绍Bregman投影;2. 探索
2025-04-04浏览详情
[CL]《Enhancing Human-Like Responses in Large Language Models》E Y Çalık, T R Akkuş (2025) 机器学习人工智能论文AI创造
2025-01-14浏览详情
[LG] Neuro-Symbolic AI in 2024: A Systematic Review 机器学习人工智能论文AI创造营
2025-01-13浏览详情
[LG]《Search-o1: Agentic Search-Enhanced Large Reasoning Models》X Li, G Dong, J Jin, Y Zhang... [Renmin University of C
[LG]《Key-value memory in the brain》S J. Gershman, I Fiete, K Irie [Harvard University & MIT] (2025) 机器学习人工智能论
[LG]《Predicting the Performance of Black-box LLMs through Self-Queries》D Sam, M Finzi, J. Z Kolter [CMU] (2025) 机器学
[LG]《Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Thought》V Xiang, C Snell, K Gandhi,
[LG]《Accelerated Diffusion Models via Speculative Sampling》V D Bortoli, A Galashov, A Gretton, A Doucet [Google DeepMi
【[37星]SAEBench:一个用于评估稀疏自编码器(SAE)模型性能的工具,提供了8种不同的评估方法,帮助研究人员和开发者更好地理解和优化SAE
[AS]《TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimiza
2025-01-11浏览详情
[LG]《Edge of Stochastic Stability: Revisiting the Edge of Stability for SGD》A Andreyev, P Beneventano [Princeton Unive
2025-01-08浏览详情
[LG]《Unlearning-based Neural Interpretations》(2024) 人工智能机器学习论文AI创造营
一份值得参考的2025年AI书单 1. 《AI Engineering》:一本讲Foundation Models的硬核好书。O'Reilly出品的这本书封面上那只机灵的
[AS]《Speech Recognition With LLMs Adapted to Disordered Speech Using Reinforcement Learning》C Nagpal, S Venugopalan, J
2025-01-07浏览详情
[CL]《Dynamic Skill Adaptation for Large Language Models》J Chen, D Yang [Stanford University] (2024) 机器学习人工智能论
[RO]《Data Scaling Laws in Imitation Learning for Robotic Manipulation》(2024) 人工智能机器学习论文AI创造营
张量到底是什么? 这个看似简单的问题困扰着许多人。它就像一个多面手,同时具备三大关键特性:等变性、多重线性和可分离性。然而,正是
2025-01-05浏览详情
[LG]《Learning and aligning single-neuron invariance manifolds in visual cortex》(2024) AI创造营人工智能机器学习论文
2025-01-04浏览详情
[LG] Introduction to Graph Neural Networks: A Starting Point for Machine Learning Engineers 机器学习人工智能论文AI创造
神经网络中的奥卡姆剃刀悖论,其实并不是真正的矛盾。 “简单即是美”这一科学哲学原则看似与神经网络的庞大参数空间背道而驰。但
通俗解读《Cross-Entropy Is All You Need To Invert the Data Generating Process》 AI创造营人工智能机器学习
[LG]《MAP: Multi-Human-Value Alignment Palette》(2024) 人工智能机器学习论文AI创造营
[IR]《Efficient Long Context Language Model Retrieval with Compression》M Seo, J Baek, S Lee, S J Hwang [KAIST] (2024)
2025-01-01浏览详情
[CL]《LLM2: Let Large Language Models Harness System 2 Reasoning》C Yang, C Shi, S Li, B Shui... [The Chinese University
[LG]《Latent Bayesian Optimization via Autoregressive Normalizing Flows》(2024) AI创造营人工智能机器学习论文
2024-12-31浏览详情
通俗解读:《Learning and aligning single-neuron invariance manifolds in visual cortex》 AI创造营人工智能机器学习
[LG]《Convergence of Statistical Estimators via Mutual Information Bounds》E M Khribch, P Alquier [ESSEC Business School
2024-12-30浏览详情
[CL]《State Space Models are Strong Text Rerankers》Z Xu, J Yan, A Gupta, V Srikumar [University of Utah] (2024) 机器学
2024-12-29浏览详情
[LG]《HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories》E Hedlin
[LG]《Rate of Model Collapse in Recursive Training》A T Suresh, A Thangaraj, A N K Khandavally [Google Research & Indian
2024-12-28浏览详情
[CV]《GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network》X Song, Y Zou, Z Shi, Z
[CL]《EscapeBench: Pushing Language Models to Think Outside the Box》C Qian, P Han, Q Luo, B He... [University of Illino
2024-12-27浏览详情
[CL]《Deliberation in Latent Space via Differentiable Cache Augmentation》L Liu, J Pfeiffer, J Wu, J Xie, A Szlam [Googl
[LG]《SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks》M V
2024-12-26浏览详情
[CL]《Emergence of Abstractions: Concept Encoding and Decoding Mechanism for In-Context Learning in Transformers》S Han,
[LG]《HashAttention: Semantic Sparsity for Faster Inference》A Desai, S Yang, A Cuadron, A Klimovic... [UC Berkeley & E
2024-12-25浏览详情
[LG]《Explainable Procedural Mistake Detection》S Storks, I Bar-Yossef, Y Li, Z Zhang... [University of Michigan] (2024)
2024-12-24浏览详情
[LG]《Cultural Evolution of Cooperation among LLM Agents》A Vallinder, E Hughes [Google DeepMind] (2024) 机器学习人工智
[LG]《Mastering Board Games by External and Internal Planning with Language Models》J Schultz, J Adamek, M Jusup, M Lanc
2024-12-23浏览详情
[CL]《How to Synthesize Text Data without Model Collapse?》X Zhu, D Cheng, H Li, K Zhang... [Shanghai Jiao Tong Universi
[CL]《Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finet
[LG]《Analyzing (In)Abilities of SAEs via Formal Languages》A Menon, M Shrivastava, D Krueger, E S Lubana [IIIT Hyderaba
[CV]《MetaMorph: Multimodal Understanding and Generation via Instruction Tuning》S Tong, D Fan, J Zhu, Y Xiong... [Meta]
[LG]《No Free Lunch From Random Feature Ensembles》B S. Ruben, W L. Tong, H T Chaudhry, C Pehlevan [Harvard University]
2024-12-22浏览详情
[CV]《[MASK] is All You Need》V T Hu, B Ommer [LMU Munich] (2024) 机器学习人工智能论文 AI创造营
[LG]《Entropy-Regularized Process Reward Model》H Zhang, P Wang, S Diao, Y Lin… [University of Illinois Urbana-Champaig
[IR]《Semantic Retrieval at Walmart》A Magnani, F Liu, S Chaidaroon, S Yadav... [Walmart Global Technology] (2024) 机器
[CL]《Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning》Z Bi, K Han, C Liu, Y Tang... [Huawei No
2024-12-21浏览详情
【TokenLearn 静态词嵌入:一种预训练模型2Vec的方法,专注于提升自然语言处理中词嵌入的静态特性,使其更适用于各种下游任务】'Tokenl
2024-12-20浏览详情
[LG]《Discover Physical Concepts and Equations with Machine Learning》B Li, Y Gu, S Wu [Shanghai University & Ludwig-Max
[LG]《Neural general circulation models optimized to predict satellite-based precipitation observations》J Yuval, I Lang
[CL]《Understanding Knowledge Hijack Mechanism in In-context Learning through Associative Memory》S Wang, I Sato [The Un
2024-12-19浏览详情
[LG] Superhuman performance of a large language model on the reasoning tasks of a physician 机器学习人工智能论文#AI创
[CL] The Open Source Advantage in Large Language Models (LLMs) 机器学习人工智能论文#AI创造营# 本文比较分析了开源和闭
[CL] Reinforcement Learning Enhanced LLMs: A Survey 机器学习人工智能论文#AI创造营# 本文对利用强化学习增强大型语言模型
[LG]《How to Merge Your Multimodal Models Over Time?》S Dziadzio, V Udandarao, K Roth, A Prabhu... [University of T ̈ub
[RO]《RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning》C Xu, Q Li, J Luo, S Levine [UC Berkeley]
2024-12-18浏览详情
[LG]《Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data》Z Zhou, A Peng, Q Li, S Levine..
2024-12-16浏览详情
[LG]《Sinkhorn Algorithm for Sequentially Composed Optimal Transports》K Watanabe, N Isobe [National Institute of Inform
【机器学习系统设计:一个专注于机器学习系统设计的资源库,提供端到端的示例和设计文档,帮助理解和应用机器学习系统设计的核心概念】
【时间序列预测评估工具:一个轻量级库,让时间序列预测模型的基准测试变得简单。它易于扩展、可复现、易用且依赖性小】'fev - A lig
【PyTorch每步容错工具:帮助保持训练连续性,即使出现错误也不会中断整个训练任务,基于PyTorch的大型训练技术】'pytorch-labs/torchf
【Even Demo:一个演示应用程序,用于展示与智能眼镜配合的功能,包括 Even AI、图片传输和文本传输等】'Even Demo' GitHub: github.co
2024-12-15浏览详情
[LG]《Statistical Downscaling via High-Dimensional Distribution Matching with Generative Models》Z Y Wan, I Lopez-Gomez,
[CL]《Advancing Single- and Multi-task Text Classification through Large Language Model Fine-tuning》H Zhao, Q P. Chen,
2024-12-14浏览详情
[LG]《APOLLO: SGD-like Memory, AdamW-level Performance》H Zhu, Z Zhang, W Cong, X Liu... [The University of Texas at Aus
2024-12-13浏览详情
[CV]《Normalizing Flows are Capable Generative Models》S Zhai, R Zhang, P Nakkiran, D Berthelot... [Apple] (2024) 机器学
[CL]《Mixture of Hidden-Dimensions Transformer》Y Chen, J Shang, Z Zhang, J Sheng... [Chinese Academy of Sciences] (2024
【Ivy:强大的机器学习框架代码转换工具,支持PyTorch、TensorFlow、JAX、NumPy等主流框架之间的代码互转,可以轻松实现模型、工具和库
[CL]《Best-of-N Jailbreaking》J Hughes, S Price, A Lynch, R Schaeffer... [Speechmatics & MATS] (2024) 机器学习人工智能论
【大型语言模型(LLM)入门指南,涵盖了LLM的优势、局限性、应用场景和研究方向】《A Primer on Large Language Models and their Limi
[LG]《Revisiting the Initial Steps in Adaptive Gradient Descent Optimization》A Abuduweili, C Liu [CMU] (2024) 机器学习
[CL]《ProcessBench: Identifying Process Errors in Mathematical Reasoning》C Zheng, Z Zhang, B Zhang, R Lin... [Alibaba I
2024-12-12浏览详情
[LG]《Flex Attention: A Programming Model for Generating Optimized Attention Kernels》J Dong, B Feng, D Guessous, Y Lian
【supertree:一个强大的Python决策树可视化工具,支持在Jupyter等环境中交互式展示决策树,包含缩放、展开折叠节点、全屏显示等功能,兼
[CL]《ALMA: Alignment with Minimal Annotation》M Yasunaga, L Shamis, C Zhou, A Cohen… [Meta] (2024) 机器学习人工智能论
【Zephyr:一个基于JAX的声明式神经网络库,让设计、创建和操作神经网络变得更简单快捷,特别适合想要快速实现机器学习想法的开发者】'
2024-12-11浏览详情
【OLMo-core:AI2开源的OLMo语言模型核心构建模块,基于PyTorch实现,提供了完整的模型训练和优化组件,支持多种规模模型(1B-13B)训练,包
2024-12-10浏览详情
[LG]《Weak-to-Strong Generalization Through the Data-Centric Lens》C Shin, J Cooper, F Sala [University of Wisconsin-Mad
【Flash Attention:基于Triton语言实现的注意力机制算法,提供高效的计算和优化,适用于大规模数据处理】'Flash Attention implemente
[LG]《FlashAttention on a Napkin: A Diagrammatic Approach to Deep Learning IO-Awareness》V Abbott, G Zardini [University
2024-12-09浏览详情
【Deep-ML开放问题库:一个开源的问题库,专注于线性代数、机器学习和深度学习,提供从零开始解决问题的丰富学习体验,助力网站Deep-ML】
[CL]《KV Shifting Attention Enhances Language Modeling》M Xu, W Cheng, B Wang, W Chen [Baichuan Inc.] (2024) 机器学习人
【tensor-man:一个用于机器学习模型文件检查、验证、签名和验证的实用工具。支持safetensors、ONNX、GGUF和PyTorch等主流格式,具备
2024-12-08浏览详情
[LG]《The Cost of Consistency: Submodular Maximization with Constant Recourse》P Dütting, F Fusco, S Lattanzi, A Norouz
[LG]《Theoretical limitations of multi-layer Transformer》L Chen, B Peng, H Wu [UC Berkeley] (2024) 机器学习人工智能论文
2024-12-07浏览详情
[LG]《Self-Improvement in Language Models: The Sharpening Mechanism》A Huang, A Block, D J. Foster, D Rohatgi... [Micros
[CV]《Enhancing Deep Learning Model Robustness through Metamorphic Re-Training》S Togru, Y S Mostafa, K Lotfy [Technical
2024-12-06浏览详情
[LG]《Learning by Self-Explaining》W Stammer, F Friedrich, D Steinmann, M Brack... [TU Darmstadt] (2024) 机器学习人工智
2024-12-05浏览详情
【FlowModels:基于Flow-Matching的生成模型实现库,提供了多种流匹配生成模型的参考实现,包括RectFlow、LADD、Shortcut等模型,支持文
[LG]《Safety Alignment Should be Made More Than Just a Few Tokens Deep》人工智能机器学习论文
[LG]《Anytime Acceleration of Gradient Descent》Z Zhang, J D. Lee, S S. Du, Y Chen [U. Washington & Princeton] (2024) 机
2024-12-04浏览详情
[CV]《SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory》C Yang, H Huang,
【Vicinity:轻量级的最近邻搜索工具库,提供灵活的后端支持。统一了不同向量检索方案的接口,支持HNSW、FAISS、Annoy等多种向量索引后
【Decoding:一个用于增强LLM推理能力的Python库,提供可组合的推理算法框架。支持自定义评分函数的采样和重排序模式,内置蒙特卡洛树
[CL]《LLMs Do Not Think Step-by-step In Implicit Reasoning》Y Yu [Tsinghua University] (2024) 机器学习人工智能论文
[CL]《Reverse Thinking Makes LLMs Stronger Reasoners》J C Chen, Z Wang, H Palangi, R Han… [UNC Chapel Hill & Google Clo
[LG]《nGPT: Normalized Transformer with Representation Learning on the Hypersphere》I Loshchilov, C Hsieh, S Sun, B Gins
正在拼命加载中
我是有底线的
没有更多的页面可以加载啦!