snxraven
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-08-15 05:09:00 +00:00
05e79cba3a fixing up token reducing if sessions too large
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-08-15 03:44:09 +00:00
f91d66b2b3 Adding assistant to Counter
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-08-15 03:04:35 +00:00
6efd069b5d counting tokens properly
snxraven opened issue snxraven/llama-cpp-python-djs-bot#3 2023-07-14 05:12:30 +00:00
OpenCL Testing
snxraven opened issue snxraven/llama-cpp-python-djs-bot#2 2023-07-14 05:11:27 +00:00
LLAMA Token Counter is Incorrect
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-06-12 16:45:44 +00:00
bd435ca311 default to disk cache
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-06-12 16:16:19 +00:00
61ee13bfbd remove comments
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-06-12 16:15:14 +00:00
10473ef702 beginnings of using a tokenizer to check CTX length, remove context if needed
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-31 21:02:10 +00:00
20c83a656a update dockerfile for GPU
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-27 05:35:44 +00:00
61c2fed773 update dockerfile for server-non-gpu
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-27 00:02:42 +00:00
6ed34de804 update readme
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-23 13:53:56 +00:00
6bf6c1ef28 Dont send anything if time==0
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-23 12:56:24 +00:00
7102bf32f0 Fix GPU Cache
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-22 16:38:50 +00:00
099dbf908b update
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-22 16:23:20 +00:00
bc6157e4a1 bug fix
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-22 15:47:01 +00:00
da0650d3b6 resetting botMessage to stop crashing
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-22 15:35:24 +00:00
393999165b adding GPU ENV Var to main compose
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-20 22:20:55 +00:00
c75926728c comments
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-20 21:47:19 +00:00
927b5c834d Add warning about caching with cuBLAS
snxraven pushed to main at snxraven/llama-cpp-python-djs-bot 2023-05-20 13:14:28 +00:00
668b343cbb changing the name scheme for docker-compose