几篇论文实现代码:
《OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking》(NeurIPS 2024) GitHub: github.com/Coo1Sea/OVT-B-Dataset
《ScanTalk: 3D Talking Heads from Unregistered Scans》(ECCV 2024) GitHub: github.com/miccunifi/ScanTalk [fig7]
《Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models》(2025) GitHub: github.com/hustvl/LightningDiT
《The Pitfalls of Memorization: When Memorization Hurts Generalization》(2024) GitHub: github.com/facebookresearch/Pitfalls-of-Memorization
《FullStack Bench: Evaluating LLMs as Full Stack Coders》(2024) GitHub: github.com/bytedance/FullStackBench [fig2]
《Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering》(2024) GitHub: github.com/Fchaubard/gradient_agreement_filtering
《Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding》(2024) GitHub: github.com/masa-ue/SVDD [fig3]
《Identity-Preserving Face Swapping via Dual Surrogate Generative Models》(2024) GitHub: github.com/ICTMCG/CSCS
《OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis》(2024) GitHub: github.com/OS-Copilot/OS-Genesis
《Practical Compact Deep Compressed Sensing》(2024) GitHub: github.com/Guaishou74851/PCNet [fig4]
《Rethinking Efficient 3D Equivariant Graph Neural Networks》(2024) GitHub: github.com/lucidrains/gotennet-pytorch [fig5]
《Long-context Protein Language Model》(2024) GitHub: github.com/amazon-science/LC-PLM
《LatentCRF: Continuous CRF for Efficient Latent Diffusion》(2024) GitHub: github.com/LatentCRF/LatentCRF
《Micro-Structures Graph-Based Point Cloud Registration for Balancing Efficiency and Accuracy》(2024) GitHub: github.com/Rolin-zrl/MicroG [fig6]
《LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation》(2024) GitHub: github.com/interview-eval/interview-eval
《Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion》(2024) GitHub: github.com/KaKituken/affordance-aware-any
《Hammer: Robust Function-Calling for On-Device Language Models via Function Masking》(2024) GitHub: github.com/MadeAgents/Hammer
《GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding》(2024) GitHub: github.com/hustvl/GaussTR
《A Hybrid Transformer-Mamba Network for Single Image Deraining》(2024) GitHub: github.com/sunshangquan/TransMamba [fig8]
《DynaMoN: Motion-Aware Fast and Robust Camera Localization for Dynamic Neural Radiance Fields》(2024) GitHub: github.com/HannahHaensen/DynaMoN [fig9]
#人工智能##AI创造营#
《OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking》(NeurIPS 2024) GitHub: github.com/Coo1Sea/OVT-B-Dataset
《ScanTalk: 3D Talking Heads from Unregistered Scans》(ECCV 2024) GitHub: github.com/miccunifi/ScanTalk [fig7]
《Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models》(2025) GitHub: github.com/hustvl/LightningDiT
《The Pitfalls of Memorization: When Memorization Hurts Generalization》(2024) GitHub: github.com/facebookresearch/Pitfalls-of-Memorization
《FullStack Bench: Evaluating LLMs as Full Stack Coders》(2024) GitHub: github.com/bytedance/FullStackBench [fig2]
《Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering》(2024) GitHub: github.com/Fchaubard/gradient_agreement_filtering
《Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding》(2024) GitHub: github.com/masa-ue/SVDD [fig3]
《Identity-Preserving Face Swapping via Dual Surrogate Generative Models》(2024) GitHub: github.com/ICTMCG/CSCS
《OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis》(2024) GitHub: github.com/OS-Copilot/OS-Genesis
《Practical Compact Deep Compressed Sensing》(2024) GitHub: github.com/Guaishou74851/PCNet [fig4]
《Rethinking Efficient 3D Equivariant Graph Neural Networks》(2024) GitHub: github.com/lucidrains/gotennet-pytorch [fig5]
《Long-context Protein Language Model》(2024) GitHub: github.com/amazon-science/LC-PLM
《LatentCRF: Continuous CRF for Efficient Latent Diffusion》(2024) GitHub: github.com/LatentCRF/LatentCRF
《Micro-Structures Graph-Based Point Cloud Registration for Balancing Efficiency and Accuracy》(2024) GitHub: github.com/Rolin-zrl/MicroG [fig6]
《LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation》(2024) GitHub: github.com/interview-eval/interview-eval
《Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion》(2024) GitHub: github.com/KaKituken/affordance-aware-any
《Hammer: Robust Function-Calling for On-Device Language Models via Function Masking》(2024) GitHub: github.com/MadeAgents/Hammer
《GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding》(2024) GitHub: github.com/hustvl/GaussTR
《A Hybrid Transformer-Mamba Network for Single Image Deraining》(2024) GitHub: github.com/sunshangquan/TransMamba [fig8]
《DynaMoN: Motion-Aware Fast and Robust Camera Localization for Dynamic Neural Radiance Fields》(2024) GitHub: github.com/HannahHaensen/DynaMoN [fig9]
#人工智能##AI创造营#