Facts About chatml Revealed
The input and output are generally of measurement n_tokens x n_embd: 1 row for every token, Every single the size from the product’s dimension.Every individual quant is in another branch. See underneath for instructions on fetching from distinct branches.At the moment, I like to recommend working with LM Studio for chatting with Hermes 2. It is j