Models available for E2E testing
DistilGPT-2 - Lightweight text generation
| Small Test | 7 tokens |
| Large Test | 128 tokens |
BERT base - Masked language model
| Small Test | 7 tokens |
| Large Test | 128 tokens |
RoBERTa base - Robust BERT variant
| Small Test | 7 tokens |
| Large Test | 128 tokens |
T5 small - Text-to-text transformer
| Small Test | 7 tokens |
| Large Test | 128 tokens |
DistilBERT - Smaller, faster BERT variant
| Small Test | 7 tokens |
| Large Test | 128 tokens |
ALBERT - Parameter-efficient BERT variant
| Small Test | 7 tokens |
| Large Test | 128 tokens |
Sentence-BERT - Text embeddings for semantic search
| Small Test | 16 tokens |
| Large Test | 128 tokens |
ResNet-50 - Image classification (1000 classes)
| Small Test | 64x64x3 |
| Large Test | 224x224x3 |
Vision Transformer (ViT) - Image classification
| Small Test | 64x64x3 |
| Large Test | 224x224x3 |
ConvNeXt Tiny - Modern CNN architecture
| Small Test | 64x64x3 |
| Large Test | 224x224x3 |
MobileNetV2 - Efficient mobile architecture
| Small Test | 64x64x3 |
| Large Test | 224x224x3 |
DeiT Small - Data-efficient Image Transformer
| Small Test | 64x64x3 |
| Large Test | 224x224x3 |
Swin Transformer - Shifted window attention
| Small Test | 64x64x3 |
| Large Test | 224x224x3 |
EfficientNet-B0 - Compound scaling CNN
| Small Test | 64x64x3 |
| Large Test | 224x224x3 |
DETR - End-to-end object detection with transformer
| Small Test | 64x64x3 |
| Large Test | 800x600x3 |
SegFormer - Semantic segmentation
| Small Test | 64x64x3 |
| Large Test | 512x512x3 |
CLIP - Image-text matching and zero-shot classification
Text + Image input
Wav2Vec2 - Speech recognition
Large Language Models using GGUF format. Requires Core GGUF runtime plugin (llama.cpp).
TinyLlama 1.1B - Small but capable chat model
| Format | GGUF |
| Small Test | 32 tokens |
| Large Test | 256 tokens |
Microsoft Phi-2 - 2.7B parameter small language model
| Format | GGUF |
| Small Test | 64 tokens |
| Large Test | 256 tokens |
Qwen2 0.5B - Ultra-small instruction-tuned model (Alibaba)
| Format | GGUF |
| Small Test | 32 tokens |
| Large Test | 128 tokens |
Meta Llama 3.2 1B - Latest small model optimized for mobile
| Format | GGUF |
| Small Test | 32 tokens |
| Large Test | 256 tokens |
Meta Llama 3.2 3B - Excellent quality/size ratio
| Format | GGUF |
| Small Test | 64 tokens |
| Large Test | 256 tokens |
DeepSeek Coder 1.3B - Code generation specialist
| Format | GGUF |
| Small Test | 64 tokens |
| Large Test | 256 tokens |
DeepSeek LLM 7B Chat - High-quality open model
| Format | GGUF |
| Small Test | 64 tokens |
| Large Test | 256 tokens |
config/models.yamlenabled: true to include in testsmake config to verifymake test to testmy_model:
enabled: true
category: nlp
axon_id: "hf/my-org/my-model@latest"
description: "My awesome model"
input_type: text
small_input:
tokens: 7
large_input:
tokens: 128