AI image generation

Six Ways to Control Style and Content in Diffusion Models

Stable Diffusion 1.5/2.0/2.1/XL 1.0, DALL-E, Imagen… Up to now years, Diffusion Models have showcased stunning quality in image generation. Nonetheless, while producing great quality on generic concepts, these struggle to generate top quality for...

The Way forward for RAG-Augmented Image Generation

Generative diffusion models like Stable Diffusion, Flux, and video models corresponding to Hunyuan depend on knowledge acquired during a single, resource-intensive training session using a hard and fast dataset. Any concepts introduced after this...

Stable Diffusion 3.5: Innovations That Redefine AI Image Generation

AI has transformed many industries, but its impact on image generation is remarkable. Tasks that when required the expertise of skilled artists or complex graphic design tools can now be achieved effortlessly with just...

Improving Green Screen Generation for Stable Diffusion

Despite community and investor enthusiasm around visual generative AI, the output from such systems will not be at all times ready for real-world usage; one example is that gen AI systems are likely to...

Disney Research Offers Improved AI-Based Image Compression – But It May Hallucinate Details

Disney's Research arm is offering a brand new approach to compressing images, leveraging the open source Stable Diffusion V1.2 model to supply more realistic images at lower bitrates than competing methods. Source: https://studios.disneyresearch.com/app/uploads/2024/09/Lossy-Image-Compression-with-Foundation-Diffusion-Models-Paper.pdfThe brand...

Generating Higher AI Video From Just Two Images

Video frame interpolation (VFI) is an open problem in generative video research. The challenge is to generate intermediate frames between two existing frames in a video sequence. Sources: https://film-net.github.io/ and https://arxiv.org/pdf/2202.04901Broadly speaking, this...

Leveraging Human Attention Can Improve AI-Generated Images

Recent research from China has proposed a technique for improving the standard of images generated by Latent Diffusion Models (LDMs) models similar to Stable Diffusion.The tactic focuses on optimizing the of a picture...

A Poisoning Attack Against 3D Gaussian Splatting

A brand new research collaboration between Singapore and China has proposed a way for attacking the favored synthesis method 3D Gaussian Splatting (3DGS). Source: https://arxiv.org/pdf/2410.08190The attack uses crafted training images of such complexity that...

Recent posts

Popular categories

ASK ANA