When using negative values in vectors, the cosine similarity is calculated incorrectly. The distance is more than 1 or less than -1 more than often if the vector contains negative values. The vector dimension size we use is 768.
I'm using kdb ai cloud version, and the outliers are negative values below -30e-3. No major outliers as this is an output of distilbert. The distances look off, manually calculating it, it seems like its not scaled (dot product and cosine similarity give me similar distances). The distance is as high as 60