Exploring the Latest Advancements in AI Research

Our community of open source research hubs has over 200,000 members building the future of AI. We are working globally with our partners, industry leaders, and experts to develop cutting-edge open AI models for Image, Language, Audio, Video, 3D, Biology and more.

View Open Roles
Use Stability HPC
MARBLE: Material Recomposition and Blending in CLIP-Space
Guest User Guest User

MARBLE: Material Recomposition and Blending in CLIP-Space

Editing materials of objects in images based on exemplar images is an active area of research in computer vision and graphics. We propose MARBLE, a method for performing material blending and recomposing fine-grained material properties by finding material embeddings in CLIP-space and using that to control pre-trained text-to-image models.

Read More
FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image
Guest User Guest User

FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image

We present a novel framework for generating high-quality, animatable 4D avatar from a single image. While recent advances have shown promising results in 4D avatar creation, existing methods either require extensive multiview data or struggle with shape accuracy and identity consistency.

Read More
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation
Guest User Guest User

Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

Diffusion models are the main driver of progress in image and video synthesis, but suffer from slow inference speed. Distillation methods, like the recently introduced adversarial diffusion distillation (ADD) aim to shift the model from many-shot to single-step inference, albeit at the cost of expensive and difficult optimization due to its reliance on a fixed pretrained DINOv2 discriminator.

Read More
Stable Audio Open
Guest User Guest User

Stable Audio Open

Here we describe the architecture and training process of a new open-weights text-to-audio model trained with Creative Commons data. Our evaluation shows that the model's performance is competitive with the state-of-the-art across various metrics.

Read More