train-from-scratch

Here are 3 public repositories matching this topic...

OpenNLG / OpenBA-v2

OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-15B.

model-pruning model-compression train-from-scratch llm

Updated May 10, 2024
Python

syt2 / CNN

Star

pytorch implementation of several CNNs for image classification

cnn pytorch imagenet image-classification resnet cifar resnext pretrained-weights senet cbam sknet train-from-scratch

Updated Jul 12, 2020
Python

Experimental GPT-2 scale (~124M param) LLM trained from scratch on Google Colab. Trained on C4, Cosmopedia/Alpaca/Python mix. Includes full training pipeline, mixed dataset loader with Colab-resilient checkpointing, and log analysis tools. Honest write-up of what went wrong.

nlp pytorch transformer language-model nlp-machine-learning c4 google-colab gpt2 train-from-scratch llm bitsandbytes cosmopedia

Updated Feb 19, 2026
TypeScript

Improve this page

Add a description, image, and links to the train-from-scratch topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the train-from-scratch topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly