Broadcast News
26/02/2025
Alibaba Cloud Makes Its AI Models For Video Generation Freely Available

Alibaba Cloud has made its AI models for video generation freely available as part of its latest efforts to contribute to the open-source community.
The cloud computing company is open sourcing four models of its 14-billion(B)-parameter and 1.3-billion(B)-parameter versions of Wan2.1 series, the latest iteration of its video foundation model Tongyi Wanxiang (Wan).
The four models, including T2V-14B, T2V-1.3B, I2V-14B-720P and I2V-14B-480P, are designed to generate high-quality images and videos from text and image inputs. They are available for download on Alibaba Cloud’s AI model community, ModelScope, and the collaborative AI platform HuggingFace, accessible to academics, researchers and commercial institutions worldwide.
Unveiled earlier this year, Wan2.1 series is the first video generation model to support text effects in both Chinese and English. It excels at generating realistic visuals by accurately handling complex movements, enhancing pixel quality, adhering to physical principles, and optimizing the precision of instruction execution. Its precision in following instructions has propelled Wan2.1 to the top of the VBench leaderboard, a comprehensive benchmark suite for video generative models. According to VBench, the Wan2.1 series, with an overall score of 86.22%, leads in key dimensions such as dynamic degree, spatial relationships, colour and multi-object interactions.
Training video foundation models requires immense computing resources and vast amounts of high-quality training data. Open access helps lower the barrier for more businesses to leverage AI, enabling them to create high-quality visual content tailored to their needs in a cost-effective way.
The T2V-14B model is better suited for creating high-quality visuals with substantial motion dynamics, while the T2V-1.3B model strikes a balance between generation quality and computational power, making it ideal for a broad range of developers conducting secondary development and academic research. For example, the T2V-1.3B model allows users with standard personal laptops to generate a 5-second-long video at 480p resolution in as little as around 4 minutes.
In addition to supporting text-to-video generation, the I2V-14B-720P and I2V-14B-480P models also offer image-to-video capabilities. Users simply need to input a single image along with a brief text description to generate dynamic video content. The platform supports normal-sized image inputs of any dimensions.
Alibaba Cloud was one of the first major global tech companies to make its self-developed large-scale AI model open-source, releasing its first open-model Qwen (Qwen-7B) in August 2023. Qwen open-models have consistently topped the HuggingFace Open LLM Leaderboards, with performances matching that of leading global AI models across various benchmarks.
www.alibabacloud.com
The cloud computing company is open sourcing four models of its 14-billion(B)-parameter and 1.3-billion(B)-parameter versions of Wan2.1 series, the latest iteration of its video foundation model Tongyi Wanxiang (Wan).
The four models, including T2V-14B, T2V-1.3B, I2V-14B-720P and I2V-14B-480P, are designed to generate high-quality images and videos from text and image inputs. They are available for download on Alibaba Cloud’s AI model community, ModelScope, and the collaborative AI platform HuggingFace, accessible to academics, researchers and commercial institutions worldwide.
Unveiled earlier this year, Wan2.1 series is the first video generation model to support text effects in both Chinese and English. It excels at generating realistic visuals by accurately handling complex movements, enhancing pixel quality, adhering to physical principles, and optimizing the precision of instruction execution. Its precision in following instructions has propelled Wan2.1 to the top of the VBench leaderboard, a comprehensive benchmark suite for video generative models. According to VBench, the Wan2.1 series, with an overall score of 86.22%, leads in key dimensions such as dynamic degree, spatial relationships, colour and multi-object interactions.
Training video foundation models requires immense computing resources and vast amounts of high-quality training data. Open access helps lower the barrier for more businesses to leverage AI, enabling them to create high-quality visual content tailored to their needs in a cost-effective way.
The T2V-14B model is better suited for creating high-quality visuals with substantial motion dynamics, while the T2V-1.3B model strikes a balance between generation quality and computational power, making it ideal for a broad range of developers conducting secondary development and academic research. For example, the T2V-1.3B model allows users with standard personal laptops to generate a 5-second-long video at 480p resolution in as little as around 4 minutes.
In addition to supporting text-to-video generation, the I2V-14B-720P and I2V-14B-480P models also offer image-to-video capabilities. Users simply need to input a single image along with a brief text description to generate dynamic video content. The platform supports normal-sized image inputs of any dimensions.
Alibaba Cloud was one of the first major global tech companies to make its self-developed large-scale AI model open-source, releasing its first open-model Qwen (Qwen-7B) in August 2023. Qwen open-models have consistently topped the HuggingFace Open LLM Leaderboards, with performances matching that of leading global AI models across various benchmarks.
www.alibabacloud.com
Top Related Stories
Click here for the latest broadcast news stories.
21/01/2025
Alibaba Cloud Unveils Latest AI Models, ToolsAand Infrastructure Available
Alibaba Cloud has unveiled an expanded suite of large language models and AI development tools, upgraded infrastructure offerings, and new support pro
Alibaba Cloud Unveils Latest AI Models, ToolsAand Infrastructure Available
Alibaba Cloud has unveiled an expanded suite of large language models and AI development tools, upgraded infrastructure offerings, and new support pro
10/02/2025
G42 & AMD To Enable AI Innovation
G42, the leading AI technology group from Abu Dhabi, United Arab Emirates (UAE), announced today a strategic investment in France in partnership with
G42 & AMD To Enable AI Innovation
G42, the leading AI technology group from Abu Dhabi, United Arab Emirates (UAE), announced today a strategic investment in France in partnership with
17/12/2024
Alibaba Cloud Named Leader in The Forrester Wave™: Public Cloud Platforms Q4 2024 Report
Alibaba Cloud has recently been named a Leader in The Forrester Wave™: Public Cloud Platforms Q4 2024 report. Alibaba Cloud believes this designation
Alibaba Cloud Named Leader in The Forrester Wave™: Public Cloud Platforms Q4 2024 Report
Alibaba Cloud has recently been named a Leader in The Forrester Wave™: Public Cloud Platforms Q4 2024 report. Alibaba Cloud believes this designation
12/08/2024
Alibaba Cloud Harnesses The Power Of Its Cloud Technologies
Alibaba Cloud has harnessed the power of its cloud technologies to drive multiple initiatives throughout the Olympic Games Paris 2024 (Paris 2024). Th
Alibaba Cloud Harnesses The Power Of Its Cloud Technologies
Alibaba Cloud has harnessed the power of its cloud technologies to drive multiple initiatives throughout the Olympic Games Paris 2024 (Paris 2024). Th
24/10/2024
StreamGuys To Shine Light On Business Models For Audio And Video Streams
StreamGuys will shine a light on business models for audio and video streams in a new webinar series beginning next week, emphasising how broadcasters
StreamGuys To Shine Light On Business Models For Audio And Video Streams
StreamGuys will shine a light on business models for audio and video streams in a new webinar series beginning next week, emphasising how broadcasters
15/03/2004
Ross Video to launch 10 new multi-definition switcher models at NAB 2004
Ross Video has announced the launch of 10 new multi-definition switcher models to be introduced at NAB 2004. Models range from the compact and powerfu
Ross Video to launch 10 new multi-definition switcher models at NAB 2004
Ross Video has announced the launch of 10 new multi-definition switcher models to be introduced at NAB 2004. Models range from the compact and powerfu
30/10/2024
Brightcove AI Suite To Be Expanded
Brightcove has announced it will expand its recently launched Brightcove AI Suite with a new AI-Text-to-Video pilot in Q1 2025 and will soon add addit
Brightcove AI Suite To Be Expanded
Brightcove has announced it will expand its recently launched Brightcove AI Suite with a new AI-Text-to-Video pilot in Q1 2025 and will soon add addit
29/05/2024
Alibaba Cloud Tests Its AI-Enhanced Multi-Camera Replay Service
Alibaba Cloud in partnership with Olympic Broadcasting Services (OBS) has tested its latest AI-enhanced multi-camera replay service at the Olympic Qua
Alibaba Cloud Tests Its AI-Enhanced Multi-Camera Replay Service
Alibaba Cloud in partnership with Olympic Broadcasting Services (OBS) has tested its latest AI-enhanced multi-camera replay service at the Olympic Qua
29/02/2024
Newsbridge Unveils Next-Generation MXT-1.5 AI
At the 2024 NAB Show, Newsbridge will showcase groundbreaking new features powered by the latest version of its award-winning AI indexing models. Atte
Newsbridge Unveils Next-Generation MXT-1.5 AI
At the 2024 NAB Show, Newsbridge will showcase groundbreaking new features powered by the latest version of its award-winning AI indexing models. Atte
23/05/2024
Advanced Systems Group Launches AI and Advanced Analytics Practice
Advanced Systems Group, LLC (ASG) has launched its 'AI and Advanced Analytics' practice, allowing for delivery of complete AI solutions. The AI team w
Advanced Systems Group Launches AI and Advanced Analytics Practice
Advanced Systems Group, LLC (ASG) has launched its 'AI and Advanced Analytics' practice, allowing for delivery of complete AI solutions. The AI team w
11/01/2023
iSIZE BitClear Achieves AI-Driven Live Video Denoising And Upscaling
iSIZE has released first performance results taking full advantage of the special instructions enabled by Intel Advanced Matrix Extensions (Intel AMX)
iSIZE BitClear Achieves AI-Driven Live Video Denoising And Upscaling
iSIZE has released first performance results taking full advantage of the special instructions enabled by Intel Advanced Matrix Extensions (Intel AMX)
21/04/2009
Ross Video Demonstrates A.I. Memory Recall At NAB
Ross Video will demonstrate a revolutionary new memory recall feature that will enhance the way their production switchers function at NAB 2009. A.I.
Ross Video Demonstrates A.I. Memory Recall At NAB
Ross Video will demonstrate a revolutionary new memory recall feature that will enhance the way their production switchers function at NAB 2009. A.I.
16/08/2023
Magewell Announces New Models To Its Eco Capture Family Of M.2 Cards
Magewell announced two new models in the company's Eco Capture family of ultra-compact, power-efficient M.2 video capture cards. Hot on the heels of t
Magewell Announces New Models To Its Eco Capture Family Of M.2 Cards
Magewell announced two new models in the company's Eco Capture family of ultra-compact, power-efficient M.2 video capture cards. Hot on the heels of t
25/05/2022
Marshall Electronics Updates Its POV And PTZ Camera Models
Marshall Electronics is continuing to update and add features to its POV and PTZ camera models with accessory add-ons and easy field-upgradable firmwa
Marshall Electronics Updates Its POV And PTZ Camera Models
Marshall Electronics is continuing to update and add features to its POV and PTZ camera models with accessory add-ons and easy field-upgradable firmwa
08/10/2021
VO To Provide Information About OTT Business Models
Viaccess-Orca (VO) has announced it will provide a wealth of information about OTT business models and technology during a panel discussion at the 202
VO To Provide Information About OTT Business Models
Viaccess-Orca (VO) has announced it will provide a wealth of information about OTT business models and technology during a panel discussion at the 202