Skip to content

Fixing build and defaulting models.home#35

Merged
kherud merged 6 commits intokherud:masterfrom
cestella:cstella/fix_build
Dec 20, 2023
Merged

Fixing build and defaulting models.home#35
kherud merged 6 commits intokherud:masterfrom
cestella:cstella/fix_build

Conversation

@cestella
Copy link
Contributor

@cestella cestella commented Dec 19, 2023

This PR does a couple of things:

  • I noticed that master no longer builds for me (I get a SIGABRT when trying to load the jllama.cpp library). I tracked it down to the lack of LLAMA_NATIVE which you removed here. I'm not exactly sure why this is giving my Mac Studio fits (I am running a M2 Ultra with 128G of RAM on Ventura), but it is.
  • As per here I'm defaulting models.home to models in both the integration test run as well as via mvn exec:java execution.
  • Also, as per here I'm migrating the integration test to the 2-big quantization so we can use a smaller model.

hs_err_pid*
replay_pid*

models/*.gguf
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This makes sure we don't accidentally check in a chonker .gguf file.

endif()

# general
option(LLAMA_NATIVE "llama: enable -march=native flag" ON)
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm genuinely not sure why this caused issues. If you have ideas or alternative approaches, do let me know.

Assert.assertTrue(output.matches("[ab]+"));
Assert.assertEquals(nPredict, model.encode(output).length);
int generated = model.encode(output).length;
Assert.assertTrue(generated > 0 && generated <= nPredict);
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suppose the number of tokens generated parameter is an upper bound rather than a guarantee. It's consistently off by 1 in the 2 bit quantization.

@kherud kherud merged commit 4aad7f9 into kherud:master Dec 20, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants