Skip to content
GitLab
Explore
Sign in
Tags
Tags give the ability to mark specific points in history as being important
master-f0d70f1
f0d70f14
·
Various fixes to mat_mul benchmark (#1253)
·
Apr 30, 2023
master-3e5aa8a
3e5aa8a1
·
ggml : fix labels for GGML_OP_ALIBI
·
Apr 30, 2023
master-c3ca7a5
c3ca7a5f
·
ggml : fix 32-bit ARM NEON
·
Apr 29, 2023
master-e8c0516
e8c05161
·
ggml : use vzip instead of vuzp for consistency
·
Apr 29, 2023
master-0b5a935
0b5a9350
·
ggml : fix visibility and unused warnings
·
Apr 29, 2023
master-ec728e4
ec728e44
·
ggml : fix #if for f32_f32 mul_mat (CLBlast) (#1229)
·
Apr 29, 2023
master-214b6a3
214b6a35
·
ggml : adjust mul_mat_f16 work memory (#1226)
·
Apr 29, 2023
master-305eb5a
305eb5af
·
build : fix reference to old llama_util.h
·
Apr 29, 2023
master-334637e
334637e4
·
common : change default parameters to pre-#1126 (#1223)
·
Apr 29, 2023
master-dd7eff5
dd7eff57
·
llama : new sampling algorithms (#1126)
·
Apr 29, 2023
master-7fc50c0
7fc50c05
·
cuBLAS: use host pinned memory and dequantize while copying (#1207)
·
Apr 29, 2023
master-b1ee8f5
b1ee8f59
·
cuBLAS: non-contiguous tensor support (#1215)
·
Apr 29, 2023
master-36d19a6
36d19a60
·
Remove Q4_3 which is no better than Q5 (#1218)
·
Apr 28, 2023
master-55390bc
55390bca
·
ggml : sync ggml (ggml_alibi)
·
Apr 28, 2023
master-1481a9c
1481a9cf
·
llama : add session file format and saved sessions in main (#1169)
·
Apr 28, 2023
master-11d9023
11d90236
·
ggml : add helper debug printf in soft_max
·
Apr 28, 2023
master-7296c96
7296c961
·
ggml : add CLBlast support (#1164)
·
Apr 28, 2023
master-92a6e13
92a6e13a
·
Add Manjaro CUDA include and lib dirs to Makefile (#1212)
·
Apr 28, 2023
master-04aaae1
04aaae1d
·
add avx2 for dot_q8_0_q8_0, 2x faster than scalar (#1211)
·
Apr 28, 2023
master-0b2da20
0b2da205
·
ggml : slightly faster AVX2 implementation for Q5 (#1197)
·
Apr 26, 2023
Prev
1
…
83
84
85
86
87
88
89
90
91
…
99
Next