awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Cross-Modal Representations · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesCross-Modal Representations

Techniques for mapping different data types into a shared latent space.

Distinguishing note: Focuses on bridging linguistic and sonic domains via shared latent spaces.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Cross-Modal Representations. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Cross-Modal Representations

Awesome Cross-Modal Representations GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • suno-ai/bark

    suno-ai/bark

    38,980View on GitHub↗

    Bark is a generative audio engine and machine learning inference library designed to convert written text into high-fidelity speech and sound effects. It functions as a text-to-audio transformer, utilizing multi-stage neural network architectures to map semantic input tokens into detailed audio codebooks for synthesis. The system distinguishes itself through a hierarchical transformer stacking approach that separates semantic understanding from acoustic realization. By employing autoregressive token prediction and vector quantized codebook mapping, the engine bridges linguistic and sonic doma

    Projects text and audio data into a shared mathematical space to bridge linguistic and sonic domains.

    Jupyter Notebook
    38,980View on GitHub↗