Usage terms and licensing conditions for Token Haven's premium multilingual LLM training datasets.
Last updated: August 9, 2025
This Dataset License Agreement ("License") governs your use of the high-quality, curated multilingual datasets provided by Token Haven. Our datasets include Spanish, Arabic, and Norwegian language data specifically prepared for training large language models.
Dataset Coverage:
Use datasets for academic research, language model development, and AI research projects.
Train commercial language models and AI systems using our curated datasets.
Fine-tune existing language models for improved multilingual performance.
Use in educational institutions for teaching and learning about NLP and machine learning.
You may not redistribute, resell, or share the datasets with third parties without explicit written permission.
Datasets may not be used for creating harmful, discriminatory, or illegal content or systems.
Token Haven provides datasets that have been carefully curated, deduplicated, and formatted for optimal training performance. However, we do not guarantee that the datasets are completely error-free or suitable for every specific use case.
The datasets are provided "as is" without warranties of any kind. Token Haven disclaims all warranties, whether express or implied, including but not limited to warranties of merchantability, fitness for a particular purpose, and non-infringement.
Subject to the terms and conditions of this License, Token Haven grants you a non-exclusive, non-transferable license to use the datasets for the permitted purposes outlined above.
This License is effective upon your download or use of the datasets and continues until terminated. We may terminate this License immediately if you breach any of its terms.
This License shall be governed by and construed in accordance with the laws of the jurisdiction where Token Haven operates.
If you have questions about this Dataset License Agreement or need clarification about permitted uses, please contact us:
Token Haven Licensing Department
For licensing inquiries, commercial partnerships, or technical support regarding our datasets.