几篇论文实现代码：《Salient Object-Aware B-20241225135631（微信文章未删减版）

几篇论文实现代码：
《Salient Object-Aware Background Generation using Text-Guided Diffusion Models》(CVPR 2024) GitHub: github.com/yahoo/photo-background-generation
《Dyadic Interaction Modeling for Social Behavior Generation》(ECCV 2024) GitHub: github.com/Boese0601/Dyadic-Interaction-Modeling
《Large Motion Video Autoencoding with Cross-modal Video VAE》(2024) GitHub: github.com/VideoVerses/VideoVAEPlus
《Automating the Search for Artificial Life with Foundation Models》(2024) GitHub: github.com/SakanaAI/asal [fig1]
《DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation》(2024) GitHub: github.com/TencentARC/DiTCtrl
《PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World》(2024) GitHub: github.com/GAIR-NLP/PC-Agent [fig2]
《Hyper-Connections》(2024) GitHub: github.com/lucidrains/hyper-connections [fig3]
《Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation》(2024) GitHub: github.com/LatentSB/LatentSB
《Building Math Agents with Multi-Turn Iterative Preference Learning》(2024) GitHub: github.com/WeiXiongUST/Building-Math-Agents-with-Multi-Turn-Iterative-Preference-Learning
《B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners》(2024) GitHub: github.com/hkust-nlp/B-STaR
《OpenRFT: Adapting Reasoning Foundation Model for Domain-Specific Tasks with Reinforcement Fine-Tuning》(2024) GitHub: github.com/ADaM-BJTU/OpenRFT [fig4]
《Exploring Enhanced Contextual Information for Video-Level Object Tracking》(2024) GitHub: github.com/kangben258/MCITrack [fig5]
《MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt》(2024) GitHub: github.com/924973292/MambaPro [fig6]
《InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models》(2024) GitHub: github.com/congvvc/InstructSeg [fig7]
《SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator》(2024) GitHub: github.com/HKUDS/SepLLM [fig8]
《Learning Source Disentanglement in Neural Audio Codec》(2024) GitHub: github.com/XiaoyuBIE1994/SDCodec [fig9]
《MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models》(2024) GitHub: github.com/MTU-Bench-Team/MTU-Bench [fig10]
《EVA-Gaussian: 3D Gaussian-Based Real-time Human Novel View Synthesis Under Diverse Camera Settings》(2024) GitHub: github.com/zhenliuZJU/EVA-Gaussian
《Moving Object Segmentation in Point Cloud Data using Hidden Markov Models》(2024) GitHub: github.com/vb44/HMM-MOS
《Offline Reinforcement Learning for LLM Multi-Step Reasoning》(2024) GitHub: github.com/jwhj/OREO
《Learning Pattern-Specific Experts for Time Series Forecasting Under Patch-level Distribution Shift》(2024) GitHub: github.com/syrGitHub/TFPS [fig11]
《Dyadic Interaction Modeling for Social Behavior Generation》(ECCV 2024) GitHub: github.com/Boese0601/Dyadic-Interaction-Modeling
《DEIO: Deep Event Inertial Odometry》(2024) GitHub: github.com/arclab-hku/DEIO [fig12]
《A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement》(2024) GitHub: github.com/nju-websoft/PairCoder
《Light Unbalanced Optimal Transport》(2024) GitHub: github.com/milenagazdieva/LightUnbalancedOptimalTransport
《PruneVid: Visual Token Pruning for Efficient Video Large Language Models》(2024) GitHub: github.com/Visual-AI/PruneVid

几篇论文实现代码：《Salient Object-Aware B-20241225135631

正文

2024-12-25 13:56
本条微博链接