Fixing build and defaulting models.home by cestella · Pull Request #35 · kherud/java-llama.cpp

cestella · 2023-12-19T22:58:25Z

This PR does a couple of things:

I noticed that master no longer builds for me (I get a SIGABRT when trying to load the jllama.cpp library). I tracked it down to the lack of LLAMA_NATIVE which you removed here. I'm not exactly sure why this is giving my Mac Studio fits (I am running a M2 Ultra with 128G of RAM on Ventura), but it is.
As per here I'm defaulting models.home to models in both the integration test run as well as via mvn exec:java execution.
Also, as per here I'm migrating the integration test to the 2-big quantization so we can use a smaller model.

cestella · 2023-12-19T23:10:24Z

.gitignore

 hs_err_pid*
 replay_pid*

+models/*.gguf


This makes sure we don't accidentally check in a chonker .gguf file.

cestella · 2023-12-19T23:11:33Z

build-args.cmake

 endif()

+# general
+option(LLAMA_NATIVE "llama: enable -march=native flag" ON)


I'm genuinely not sure why this caused issues. If you have ideas or alternative approaches, do let me know.

cestella · 2023-12-19T23:31:09Z

src/test/java/de/kherud/llama/LlamaModelIT.java

 		Assert.assertTrue(output.matches("[ab]+"));
-		Assert.assertEquals(nPredict, model.encode(output).length);
+		int generated = model.encode(output).length;
+		Assert.assertTrue(generated > 0 && generated <= nPredict);


I suppose the number of tokens generated parameter is an upper bound rather than a guarantee. It's consistently off by 1 in the 2 bit quantization.

cestella added 3 commits December 19, 2023 17:34

Adding back in llama_native

b7626c2

Creating models directory and automatically defaulting to it.

95599d9

defaulting for exec as well

86cf3a5

cestella commented Dec 19, 2023

View reviewed changes

.gitignore

hs_err_pid*

replay_pid*

models/*.gguf

Copy link

Contributor Author

cestella Dec 19, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This makes sure we don't accidentally check in a chonker .gguf file.

cestella commented Dec 19, 2023

View reviewed changes

cestella added 3 commits December 19, 2023 18:13

fixing formatting

8c979a5

Migrated test to a smaller mistral model

e94a2a1

reverting

aeb38c6

cestella commented Dec 19, 2023

View reviewed changes

kherud merged commit 4aad7f9 into kherud:master Dec 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing build and defaulting models.home#35

Fixing build and defaulting models.home#35
kherud merged 6 commits intokherud:masterfrom
cestella:cstella/fix_build

cestella commented Dec 19, 2023 •

edited

Loading

Uh oh!

cestella Dec 19, 2023

Uh oh!

cestella Dec 19, 2023

Uh oh!

cestella Dec 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cestella commented Dec 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cestella Dec 19, 2023

Choose a reason for hiding this comment

Uh oh!

cestella Dec 19, 2023

Choose a reason for hiding this comment

Uh oh!

cestella Dec 19, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cestella commented Dec 19, 2023 •

edited

Loading