Skip to content
GitLab
Explore
Sign in
Tags
Tags give the ability to mark specific points in history as being important
b2817
83330d8c
·
main : add --conversation / -cnv flag (#7108)
·
May 08, 2024
b2816
465263d0
·
sgemm : AVX Q4_0 and Q8_0 (#6891)
·
May 08, 2024
b2815
911b3900
·
server : add_special option for tokenize endpoint (#7059)
·
May 08, 2024
b2813
229ffff8
·
llama : add BPE pre-tokenization for Qwen2 (#7114)
·
May 08, 2024
b2812
1fd9c174
·
clean up json_value & server_log (#7142)
·
May 08, 2024
b2811
4cd621c2
·
convert : add BPE pre-tokenization for DBRX (#7132)
·
May 08, 2024
b2808
38554160
·
ggml : introduce bfloat16 support (#6412)
·
May 08, 2024
b2805
48b2f9c1
·
Fixed save_imatrix to match old behaviour for MoE (#7099)
·
May 08, 2024
b2804
af0a5b61
·
server: fix incorrectly reported token probabilities (#7125)
·
May 07, 2024
b2803
b6aa6702
·
Fix OLMo HF to GGUF conversion (#6910)
·
May 07, 2024
b2800
3af34c1d
·
main : update log text (EOS to EOG) (#7104)
·
May 07, 2024
b2797
858f6b73
·
Add an option to build without CUDA VMM (#7067)
·
May 06, 2024
b2794
628b2991
·
Adding support for the --numa argument for llama-bench. (#7080)
·
May 05, 2024
b2793
8f8acc86
·
Disable benchmark on forked repo (#7034)
·
May 05, 2024
b2791
889bdd76
·
command-r : add BPE pre-tokenization (#7063)
·
May 05, 2024
b2789
84250014
·
gguf-split: add --no-tensor-first-split (#7072)
·
May 04, 2024
b2787
fcd84a0f
·
Fix Linux /sys cpu path to guess number of cores (#7064)
·
May 04, 2024
b2785
92139b90
·
tests : add test-tokenizer-0.sh + fix some tokenizers (#7036)
·
May 04, 2024
b2784
a2ac89d6
·
convert.py : add python logging instead of print() (#6511)
·
May 03, 2024
b2783
433def28
·
llama : rename ctx to user_data in progress_callback (#7045)
·
May 03, 2024
Prev
1
…
6
7
8
9
10
11
12
13
14
…
99
Next