Deep Chic, who shocked with the reasoning model ‘R1’, has launched an open source image model. Following the language model, it was intended to proceed the momentum within the image field, and said that it surpassed the ‘Dali 3’ of Open AI and ‘Stable Defusion’ of the Stage AI.
Deep Chic will understand and create a picture through Hugging Face on the twenty seventh (local time) ‘Janus Pro 7B ‘Launched.
In keeping with technical papers, this model is superb in various visual tasks akin to realistic image creation, complex visual reasoning, and image caption creation as a bonus of efficiency and variety. “The goal is to balance the performance and the fee of calculation, and achieved cutting -edge performance in a big selection of vision work.”
Following the V3, which was released last month and the R1, which was released last week, it’s the third major model released a month. This time, it emphasized efficiency. It is feasible to supply high levels of performance without requiring an unlimited calculation resource.
This model is an upgraded ‘Janus’ model. The researchers said that they separated the encoders who’re accountable for understanding images and the encoders that handle the image production to beat the restrictions of the previous model, and optimized the performance and improved the output quality. As a substitute, the integrated transformer architecture was used for processing.
As well as, it has been effective micro -adjustment through the model learning process and data adjustment, and specifically, it has improved stability and output accuracy, including 72 million synthetic data samples and 90 million multimodal datasets.
As well as, he stressed that it has expanded its existing 1 billion parameters to expand its ability to input complex inputs and various tasks.
The input image evaluation is proscribed to 384×384 resolution. Nonetheless, the benchmark surpassed the performance of other models.
Specifically, within the ‘Geneval’, which analyzes objects within the image, and ‘DGP-Bench’, which tests the image creation function for complex and demanding prompts, He said.

The researchers said, “Janus-Pro surpasses the previous integrated model and is identical or superior to the performance of the prevailing model by work.” .
Deep chic can experience this model Demo siteIt was also released.
By Park Chan, reporter cpark@aitimes.com