On August 25,????? ?????? ?????? Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
Joss Whedon's 'The Nevers' is a weak remix of his formerly good ideasApril the giraffe, known for her viral livestreamed pregnancy, dies at 20How to shut down Spot the robot 'dog,' should you ever need toOutdated pregnancy terms need a rebrandChevy takes on Ford's electric FInstagram will let some users choose whether they want to see like counts13 best tweets of the week, including badass nuns, Werner Herzog, and Coach Horny'Worn Stories' on Netflix has dazzling animation unlike anything you've seen on TVOwning an e'Primal' on HBO Max makes you feel deeply without a word: Review Lynk & Co’s flagship SUV to compete with Range Rover, Li Auto’s L9 · TechNode NetEase to launch mobile version of new martial arts game Where Winds Meet next week · TechNode US weighs potential regulations on Chinese drones · TechNode Chinese startup Sharge unveils first mass Former Microsoft and Alibaba veteran Hu Yunhua joins Zhipu AI as head of ChatGLM · TechNode NYT Connections Sports Edition hints and answers for June 1: Tips to solve Connections #251 Huawei Enjoy 70X smartphone to launch on January 3 with Kirin 8000A processor · TechNode Amazon Fire TV soundbar 2.0: $80 off at Woot JD Logistics opens first self Best portable charger deal: Save 50% on the Anker Zolo power bank
0.51s , 9892.8125 kb
Copyright © 2025 Powered by 【????? ?????? ??????】Alibaba Cloud launches open source Large Vision Language Model Qwen,Feature Flash