Jais Adapted 13B by inceptionai

 ยป  All LLMs  ยป  inceptionai  ยป  Jais Adapted 13B   URL Share it on

  Arxiv:2307.09288   Arxiv:2308.16149   Arxiv:2402.12840   Ar   Arabic Base model:finetune:meta-llama... Base model:meta-llama/llama-2-...   Decoder   En   English   Jais-family   Llama   Region:us   Safetensors   Sharded   Tensorflow

Jais Adapted 13B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
๐ŸŒŸ Advertise your project ๐Ÿš€

Jais Adapted 13B Parameters and Internals

Model Type 
LLM, Decoder, causal-lm
Use Cases 
Areas:
Research, Commercial applications
Applications:
Natural language understanding and generation, Mechanistic interpretability, Sentiment analysis, Summarization
Primary Use Cases:
Research purposes for Arabic NLP, Commercial chat applications, Sentiment analysis, Academic research
Limitations:
Prohibited from generating harmful content, Sensitive information handling, Generalization across non-supported languages, High-stakes decision making
Considerations:
Efforts to ensure cultural adaptation and diverse topic range in fine-tuning datasets.
Additional Notes 
Techniques used for Arabic model augmentation applicable to other low-resource languages.
Supported Languages 
Arabic (MSA) (Strong capabilities), English (Strong capabilities)
Training Details 
Data Sources:
Web pages, Wikipedia articles, News articles, Social network content, Code data, Books, Scientific papers, Synthetic data (English to Arabic translations)
Data Volume:
Up to 1.6 Trillion tokens
Methodology:
Documents packed with EOS tokens for pre-training and frozen backbone during adapted pre-training. Instructional fine-tuning for chat models.
Context Length:
16384
Hardware Used:
Condor Galaxy supercomputer, 64 Cerebras CS-2 Wafer-Scale Engines
Model Architecture:
Auto-regressive Transformer-based, decoder-only architecture with support for long context lengths.
Responsible Ai Considerations 
Mitigation Strategies:
Minimized biases; AI assistant role limited to Arabic and English for fine-tuned models.
Input Output 
Input Format:
Text inputs
Accepted Modalities:
text
Output Format:
Generated text
LLM NameJais Adapted 13B
Repository ๐Ÿค—https://huggingface.co/inceptionai/jais-adapted-13b 
Base Model(s)  Llama 2 13B   meta-llama/Llama-2-13b
Model Size13b
Required VRAM53.5 GB
Updated2024-11-04
Maintainerinceptionai
Model Typellama
Model Files  4.8 GB: 1-of-11   4.8 GB: 2-of-11   4.8 GB: 3-of-11   5.0 GB: 4-of-11   5.0 GB: 5-of-11   5.0 GB: 6-of-11   5.0 GB: 7-of-11   4.8 GB: 8-of-11   4.8 GB: 9-of-11   4.8 GB: 10-of-11   4.7 GB: 11-of-11
Supported Languagesar en
Gated ModelYes
Model ArchitectureLlamaForCausalLM
Licenseproprietary
Context Length4096
Model Max Length4096
Transformers Version4.38.2
Tokenizer ClassLlamaTokenizer
Vocabulary Size64000
Torch Data Typefloat32
Jais Adapted 13B (inceptionai/jais-adapted-13b)

Best Alternatives to Jais Adapted 13B

Best Alternatives
Context / RAM
Downloads
Likes
Luminaura RP 13B128K / 26 GB160
Yarn Llama 2 13B 128K128K / 26 GB2208112
Agent Llama2 13B 80K80K / 26.4 GB160
Chat Llama2 13B 80K80K / 52.8 GB160
LongAlign 13B 64K64K / 26 GB2113
LongAlign 13B 64K Base64K / 26 GB303
Openbuddy Llama2 13B V15p1 64K64K / 26.1 GB164
Openbuddy Llama2 13b64k V1564K / 26.1 GB141
Yarn Llama 2 13B 64K64K / 26 GB100217
Airoboros L2 13B 2.1 YaRN 64K64K / 26 GB297

Rank the Jais Adapted 13B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 48046 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124