#thecompaniesapi quantize phi3.5 to 4bit to use it in our / WIP

Yaël 🛸

#thecompaniesapi quantize phi3.5 to 4bit to use it in our inference server; same model size but 128k context length instead of 4k, I can now process huge chunks of texts without relying on batching

19:03 · 29/10/2024 UTC

See similar todos

No replies yet

New to WIP?

We're a group of makers shipping together. We help each other stay accountable and reach our goals.

Apply for access

Project

The Companies API

The Enrichment API for Companies

@yael

@LeCoupa

Get notified of new comments

Go to Homepage	`g` `h`
Go to Done Todos	`g` `d`
Go to Questions	`g` `q`
Compose a New Todo	`n`
Go to Search	`/`
Show this dialog	`?`

Keyboard Shortcuts