About 50 results
Open links in new tab
  1. Classification Done Right for Vision-Language Pre-Training

    2024-11-05 Classification Done Right for Vision-Language Pre-Training Download PDF

  2. Video Instruction Tuning With Synthetic Data

    Oct 3, 2024 · The development of video large multimodal models (LMMs) has been hindered by the difficulty of curating large amounts of high-quality raw data from the web. To address this, we …

  3. Seed-Thinking-v1.5: Advancing Superb Reasoning Models with ...

    Apr 10, 2025 · Seed-Thinking-v1.5 achieves 86.7 on AIME 2024, 55.0 on Codeforces and 77.3 on GPQA, demonstrating excellent reasoning abilities in STEM and coding. Beyond reasoning tasks, …

  4. MaskBit: Embedding-free Image Generation via Bit Tokens

    Dec 8, 2024 · ABSTRACT Masked transformer models for class-conditional image generation have become a compelling alternative to diffusion models. Typically comprising two stages - an initial …

  5. KOR-Bench: Benchmarking Language Models on Knowledge …

    2024-10-09 KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks Download PDF

  6. DSTC: Direct Preference Learning with Only Self-Generated Tests and ...

    2024-11-20 DSTC: Direct Preference Learning with Only Self-Generated Tests and Code to Improve Code LMs Download PDF

  7. How Far is Video Generation from World Model: A Physical Law ...

    2024-11-04 How Far is Video Generation from World Model: A Physical Law Perspective Download PDF

  8. DiG: Scalable and Efficient Diffusion Models with Gated Linear ...

    2024-11-26 DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention Download Paper Github

  9. Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

    2024-10-10 Reward-Augmented Data Enhances Direct Preference Alignment of LLMs Download PDF Github

  10. Hyper-Connections - Publications - ByteDance Seed Team

    Mar 18, 2025 · We present hyper-connections, a simple yet effective method that can serve as an alternative to residual connections. This approach specifically addresses common drawbacks …