snxraven
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-08-15 01:09:00 -04:00
05e79cba3a fixing up token reducing if sessions too large
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-08-14 23:44:09 -04:00
f91d66b2b3 Adding assistant to Counter
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-08-14 23:04:35 -04:00
6efd069b5d counting tokens properly
snxraven opened issue snxraven/llama-cpp-python-djs-bot#3 2023-07-14 01:12:30 -04:00
OpenCL Testing
snxraven opened issue snxraven/llama-cpp-python-djs-bot#2 2023-07-14 01:11:27 -04:00
LLAMA Token Counter is Incorrect
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-06-12 12:45:44 -04:00
bd435ca311 default to disk cache
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-06-12 12:16:19 -04:00
61ee13bfbd remove comments
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-06-12 12:15:14 -04:00
10473ef702 beginnings of using a tokenizer to check CTX length, remove context if needed
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-31 17:02:10 -04:00
20c83a656a update dockerfile for GPU
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-27 01:35:44 -04:00
61c2fed773 update dockerfile for server-non-gpu
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-26 20:02:42 -04:00
6ed34de804 update readme
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-23 09:53:56 -04:00
6bf6c1ef28 Dont send anything if time==0
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-23 08:56:24 -04:00
7102bf32f0 Fix GPU Cache
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-22 12:38:50 -04:00
099dbf908b update
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-22 12:23:20 -04:00
bc6157e4a1 bug fix
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-22 11:47:01 -04:00
da0650d3b6 resetting botMessage to stop crashing
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-22 11:35:24 -04:00
393999165b adding GPU ENV Var to main compose
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-20 18:20:55 -04:00
c75926728c comments
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-20 17:47:19 -04:00
927b5c834d Add warning about caching with cuBLAS
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-20 09:14:28 -04:00
668b343cbb changing the name scheme for docker-compose