Inference-Optimization

3 articles

All articles 3 total

Knowledge Distillation How teacher-student training compresses large models into smaller, … Glossary

Added 28 Mar · Upd 30 May ·2 min

Pruning How structured and unstructured pruning reduce neural network size by … Glossary

Added 28 Mar · Upd 30 May ·2 min

Quantization How INT8 and INT4 quantization compress neural network models for faster … Glossary

Added 28 Mar · Upd 30 May ·3 min

Open source projects

Freelancer Templates Contracts, proposals, SOWs

Freelancer Automation Workflow recipes, AI playbooks

Work with Linda

Workshop Series €2,000/mo x 3

1:1 Consulting 60 min session