Models by tngtech for RAG Use Cases

TNG: DeepSeek R1T2 Chimera (free)

Context Length:: 163,840 tokens
Architecture:: text->text

Pricing:

DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 671 B-parameter mixture-of-experts text-generation model assembled from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints with an Assembly-of-Experts merge. The tri-parent design yields strong reasoning performance while running roughly 20 % faster than the original R1 and more than 2× faster than R1-0528 under vLLM, giving a favorable cost-to-intelligence trade-off. The checkpoint supports contexts up to 60 k tokens in standard use (tested to ~130 k) and maintains consistent token behaviour, making it suitable for long-context analysis, dialogue and other open-ended generation tasks.

TNG: DeepSeek R1T2 Chimera

Context Length:: 163,840 tokens
Architecture:: text->text
Max Output:: 163,840 tokens

Pricing:

Prompt: $0.0000003

Completion: $0.0000012

DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 671 B-parameter mixture-of-experts text-generation model assembled from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints with an Assembly-of-Experts merge. The tri-parent design yields strong reasoning performance while running roughly 20 % faster than the original R1 and more than 2× faster than R1-0528 under vLLM, giving a favorable cost-to-intelligence trade-off. The checkpoint supports contexts up to 60 k tokens in standard use (tested to ~130 k) and maintains consistent token behaviour, making it suitable for long-context analysis, dialogue and other open-ended generation tasks.

TNG: DeepSeek R1T Chimera (free)

Context Length:: 163,840 tokens
Architecture:: text->text

Pricing:

DeepSeek-R1T-Chimera is created by merging DeepSeek-R1 and DeepSeek-V3 (0324), combining the reasoning capabilities of R1 with the token efficiency improvements of V3. It is based on a DeepSeek-MoE Transformer architecture and is optimized for general text generation tasks.

The model merges pretrained weights from both source models to balance performance across reasoning, efficiency, and instruction-following tasks. It is released under the MIT license and intended for research and commercial use.

TNG: DeepSeek R1T Chimera

Context Length:: 163,840 tokens
Architecture:: text->text
Max Output:: 163,840 tokens

Pricing:

Prompt: $0.0000003

Completion: $0.0000012

DeepSeek-R1T-Chimera is created by merging DeepSeek-R1 and DeepSeek-V3 (0324), combining the reasoning capabilities of R1 with the token efficiency improvements of V3. It is based on a DeepSeek-MoE Transformer architecture and is optimized for general text generation tasks.

The model merges pretrained weights from both source models to balance performance across reasoning, efficiency, and instruction-following tasks. It is released under the MIT license and intended for research and commercial use.

Tngtech Models

TNG: DeepSeek R1T2 Chimera (free)

TNG: DeepSeek R1T2 Chimera

TNG: DeepSeek R1T Chimera (free)

TNG: DeepSeek R1T Chimera

Ready to build with Tngtech?