Llama 3.2-FFM
The only one to support Llama 3.2 Traditional Chinese multimodal model! Based on Llama 3.2, it enhances Traditional Chinese capabilities, significantly improving image recognition (OCR), image understanding, and question answering abilities, making it the best choice for using Traditional Chinese multimodal models!
Llama 3.2-FFM
★ The only model supporting Llama 3.2 Traditional Chinese multimodal model ★
★ Version 11B is available; version 90B is coming soon. ★
★ Enhanced Traditional Chinese capabilities, enabling accurate OCR output of images in Traditional Chinese ★
★ Improve image comprehension skills, accurately identify and understand image content ★
★ Improved question-and-answer capabilities, providing precise answers to questions ★
★ Outperforms native models in multi-modal model evaluations ★
* Applicable image formats: PNG/JPG/JPEG/WebP/GIF (static)
Llama 3.2-FFM offers a significantly improved multimodal Traditional Chinese experience!
★Excellent Traditional Chinese recognition capability★
Q: Please describe the content of the image.
Llama 3.2-FFM-11B: Correctly recognizes and outputs Traditional Chinese characters.
This image is a New Year's illustration themed around 2025. The background is red, with a red firework in the upper right corner, its tail extending to the center. In the upper left corner is an orange circle containing the white number "2025." To the right of the number is a white snake with a white body, a red dot on its head, black eyes, and a smiling mouth. On the snake's body is an orange flower with a white petal in its center, and a black dot in the center of the petal.
On the right side of the image is text in white and red that reads: "A 9-day holiday – not just for relaxation, but also to broaden your horizons! The 2025 New Year holiday is a full 9 days! How can you just stay at home? Follow New Charm Travel and make this holiday your most exciting adventure, creating 9 days full of memories and stories!"
Llama 3.2-11B native model: Easily answers in English but performs poorly in Chinese recognition.
The image is a vibrant red and yellow graphic with a white snake, fireworks, and Chinese writing. The overall design suggests that it is a celebratory image for the Chinese New Year.
* **White Snake**
* The snake is positioned on the left side of the image, facing right.
* It has a large body with a long tail and a small head.
* The snake's body is adorned with gray spots.
* It has blue legs and a pink nose….
★Accurate Traditional Chinese OCR Output★
Q: List the title and text of the 4th itinerary.
Llama 3.2-FFM-11B: Correctly identifies numbers and Chinese characters.
The title of the fourth itinerary is "Gupo Island Landing, Scenic Views, and Dinner." The text reads as follows: Let's embark together on the journey to the largest uninhabited island in the Beihai area, stand at the highest point overlooking the entire island, and race against time to witness the fleeting beauty of the sunset. After nightfall, we'll go and unveil the mysteries of the Niu Po (a local deity). Afterwards, we'll return to the dining area, look up at the starry sky, and enjoy delicious food on the table while a gentle sea breeze blows. What more could one ask for?
Llama 3.2-11B native model: identifies numerical errors and has many Chinese typos.
The title of the fourth itinerary is "Night Rise at Xiaoguan and Watching the Lights at Sea," and the content is as follows: The night rise at Xiaoguan is divided into two places. If I approach, and my friend is nearby, here you will see, starting from the Fish Sea, as numerous as the stars in the sky. Here you will see, starting from the Fish Sea, as numerous as the stars in the sky. The reason is simple, therefore simple, so the rising seawater spreads and spreads here. Although the rising seawater spreads and spreads here, although the rising seawater spreads and spreads here, although the rising seawater spreads and spreads here…
★More accurate question answering using the model★
Q: What activities are included in Shin Kong Mitsukoshi Night? And when is it held?
Llama 3.2-FFM-11B: Provides precise answers to questions.
June 27th is Shin Kong Night, and the event will feature fireworks and a drone show.
Llama 3.2-11B native model: verbose and prone to repetitive answers.