VishwamAI

VishwamAI is a sophisticated machine learning library focusing on efficient model quantization, advanced tokenization, and mathematical reasoning capabilities.

Features

Advanced Tokenization: Conceptual tokenizer with semantic clustering and special token handling
Efficient Quantization: Support for FP8 and BF16 quantization
Mathematical Reasoning: Integration with GSM8K dataset for advanced mathematical problem-solving
Model Architecture: Flexible transformer-based architecture with configurable parameters
Training Utilities: Support for distributed training, mixed precision, and gradient accumulation

Installation

pip install -e .

Quick Start

from vishwamai.model import VishwamaiModel
from vishwamai.conceptual_tokenizer import ConceptualTokenizer

# Initialize tokenizer and model
tokenizer = ConceptualTokenizer()
model = VishwamaiModel()

# Example usage
text = "Solve: If John has 5 apples and gives 2 to Mary, how many does he have left?"
tokens = tokenizer.encode(text)
output = model.generate(tokens)

Testing

Run the test suite:

pytest -v

Requirements

Python >= 3.8
PyTorch >= 2.1.0
CUDA toolkit (for GPU support)
Additional dependencies listed in setup.py

Project Structure

vishwamai/
├── conceptual_tokenizer.py   # Advanced tokenization implementation
├── kernel.py                 # CUDA kernels and quantization
├── model.py                 # Core model architecture
├── training.py              # Training utilities
└── configs/                 # Model configurations

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Contributions are welcome! Please read our contributing guidelines before submitting pull requests.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.github/workflows		.github/workflows
.vscode		.vscode
Research		Research
math		math
tests		tests
vishwamai		vishwamai
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
conceptual.model		conceptual.model
conceptual.vocab		conceptual.vocab
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.py		setup.py
test_tokenizer.model		test_tokenizer.model
test_tokenizer.vocab		test_tokenizer.vocab
train.py		train.py
train_math.py		train_math.py
training_data.txt		training_data.txt
vishwamai_math_integration.ipynb		vishwamai_math_integration.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VishwamAI

Features

Installation

Quick Start

Testing

Requirements

Project Structure

License

Contributing

About

Releases

Packages

Languages

License

VishwamAI/VishwamAI

Folders and files

Latest commit

History

Repository files navigation

VishwamAI

Features

Installation

Quick Start

Testing

Requirements

Project Structure

License

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages