Microsoft’s BitNet shows what AI can do with just 400MB and no GPU

Microsoft’s BitNet shows what AI can do with just 400MB and no GPU

What just happened? Microsoft has introduced BitNet b1.58 2B4T, a new type of large language model engineered for exceptional efficiency. Unlike conventional AI models that rely on 16- or 32-bit floating-point numbers to represent each weight, BitNet uses only three discrete values: -1, 0, or +1. This approach, known as ternary quantization, allows each weight to be stored in just 1.58 bits. The result is a model that dramatically reduces memory usage and can run far more easily on…

Article Source
https://www.techspot.com/news/107617-microsoft-bitnet-shows-what-ai-can-do-400mb.html