changelog : `libllama` API #9289

ggerganov · 2024-09-03T06:48:45Z

Overview

This is a list of changes to the public interface of the llama library. Collaborators are encouraged to edit this post in order to reflect important changes to the API that end up merged into the master branch.

If you are building a 3rd party project that relies on libllama, it is recommended to follow this issue and check it before upgrading to new versions.

Recent API changes (most recent at the top)

version	PR	desc
TBD.	#9897	Deprecate `softmax` sampler and update `dist` sampler`
b3988	#10071	Remove Tail-Free sampling
b3943	#9745	Removed `all_pos_0, all_pos_1, all_seq_id` from `llama_batch`
b3908	#9798	Update FIM-related API
b3841	#9510	Add `LLAMA_POOLING_TYPE_RANK`
b3774	#9512	Add `llama_n_head()`
b3750	#9355	Add `llama_perf` API + param to disable internal profiling
b3749	#9445	Add `llama_sampler_chain_remove()`
b3681	#9294	Major changes to the sampling API (see PR for more info)
b3651	#8980	Add `LLAMA_VOCAB_TYPE_RWKV` enum value
b3644	#8672	Add `llama_threadpool` API + change `uint32_t` -> `int32_t`
b3614	#8526	Add `llama_model_is_recurrent`

For older changes, use:

git log --oneline -p b3614 -- include/llama.h

Upcoming API changes

naming : normalize the name of callback-related identifiers #9405
Accept pointers to params Library Interface Considerations for FFI (passing plain structs) #9172

The text was updated successfully, but these errors were encountered:

ggerganov · 2024-09-13T06:57:14Z

#9355 restores the functionality for getting performance measurements from within libllama (which was removed in #9294) via a new llama_perf API. The llama_context_params is extended with a new bool no_perf parameter that can be used to disable the internal timings during libllama compute.

ggerganov added the documentation Improvements or additions to documentation label Sep 3, 2024

ggerganov pinned this issue Sep 3, 2024

ggerganov mentioned this issue Sep 3, 2024

changelog : llama-server REST API #9291

Open

ngxson mentioned this issue Oct 31, 2024

llama : remove Tail-Free sampling #10071

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

changelog : `libllama` API #9289

changelog : `libllama` API #9289

ggerganov commented Sep 3, 2024 •

edited by ngxson

Loading

ggerganov commented Sep 13, 2024

changelog : libllama API #9289

changelog : libllama API #9289

Comments

ggerganov commented Sep 3, 2024 • edited by ngxson Loading

Overview

Recent API changes (most recent at the top)

Upcoming API changes

ggerganov commented Sep 13, 2024

changelog : `libllama` API #9289

changelog : `libllama` API #9289

ggerganov commented Sep 3, 2024 •

edited by ngxson

Loading