We have hosted the application tencent hunyuan large in order to run this application in our online workstations with Wine or directly.


Quick description about tencent hunyuan large:

Tencent-Hunyuan-Large is the flagship open-source large language model family from Tencent Hunyuan, offering both pre-trained and instruct (fine-tuned) variants. It is designed with long-context capabilities, quantization support, and high performance on benchmarks across general reasoning, mathematics, language understanding, and Chinese / multilingual tasks. It aims to provide competitive capability with efficient deployment and inference. FP8 quantization support to reduce memory usage (~50%) while maintaining precision. High benchmarking performance on tasks like MMLU, MATH, CMMLU, C-Eval, etc.

Features:
  • Long context window support: up to 256K tokens in pretrain; 128K tokens for instruct models
  • FP8 quantization support to reduce memory usage (~50%) while maintaining precision
  • Expert-specific learning rate scaling in training (for mixture or expert architectures)
  • High benchmarking performance on tasks like MMLU, MATH, CMMLU, C-Eval, etc.
  • Hugging Face format compatibility for fine-tuning / inference using frameworks like hf-deepspeed, plus support for flash attention, efficient operators (TRT-LLM)
  • Throughput and efficiency improvements: TRT-LLM backend surpasses vLLM by ~30 %, quantized/inference optimizations included


Programming Language: Python.
Categories:
Large Language Models (LLM), AI Models

Page navigation:

©2024. Winfy. All Rights Reserved.

By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.