Llama 3.1 update binaries by martindevans · Pull Request #874 · SciSharp/LLamaSharp

martindevans · 2024-07-28T21:08:53Z

Update to new binaries for llama.cpp 345c8c0c87a97c1595f9c8b14833d531c8c7d8df. Built with this build action. This should include the RoPE fixes for llama 3.1.

Testing:

…hen referring to the C++ - Exposed `KvCacheMaxPosition` on `SafeLLamaContextHandle`

m0nsky · 2024-08-01T17:07:43Z

Tests passed on Linux CUDA.
Built my sample app for Windows & Linux Vulkan, and both are working fine.

Not sure if we need to test Linux/MacOS CPU as this should be covered by CI.

martindevans · 2024-08-02T17:02:02Z

Not sure if we need to test Linux/MacOS CPU as this should be covered by CI.

Does the MacOS CI runner have metal or is it CPU only?

SignalRT · 2024-08-02T18:17:42Z

Not sure if we need to test Linux/MacOS CPU as this should be covered by CI.

Does the MacOS CI runner have metal or is it CPU only?

Only CPU. The metal emulation fails.

SignalRT · 2024-08-02T18:18:19Z

It works on MacOS with metal

SignalRT

Tested

martindevans · 2024-08-02T19:56:31Z

Excellent, thanks for testing that @SignalRT. I'll try to run through the release process tonight or tomorrow.

martindevans · 2024-08-03T13:51:46Z

Resolved conflicts, will merge once CI completes.

martindevans added 2 commits July 28, 2024 16:32

Changes for 345c8c0c87a97c1595f9c8b14833d531c8c7d8df

97c8786

Updated binaries

bae21b5

martindevans mentioned this pull request Jul 29, 2024

Error Loading LLAMA 3.1 llama_model_load: error loading model: done_getting_tensors: wrong number of tensors; expected 292, got 291[BUG]: #875

Closed

- Added note on LLamaAttentionType struct, making it easier to find w…

bb7523e

…hen referring to the C++ - Exposed `KvCacheMaxPosition` on `SafeLLamaContextHandle`

martindevans requested a review from SignalRT July 31, 2024 12:05

neuhaus mentioned this pull request Aug 2, 2024

Bug: llama 3.1 and variants fail with error "wrong number of tensors; expected 292, got 291" mozilla-ai/llamafile#516

Closed

SignalRT approved these changes Aug 2, 2024

View reviewed changes

Merge branch 'master' into llama_3.1_update_binaries

d1dbb21

martindevans merged commit bb2f3ad into SciSharp:master Aug 3, 2024

martindevans deleted the llama_3.1_update_binaries branch August 3, 2024 14:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama 3.1 update binaries#874

Llama 3.1 update binaries#874
martindevans merged 4 commits intoSciSharp:masterfrom
martindevans:llama_3.1_update_binaries

martindevans commented Jul 28, 2024 •

edited by SignalRT

Loading

Uh oh!

m0nsky commented Aug 1, 2024

Uh oh!

martindevans commented Aug 2, 2024

Uh oh!

SignalRT commented Aug 2, 2024

Uh oh!

SignalRT commented Aug 2, 2024

Uh oh!

SignalRT left a comment

Uh oh!

martindevans commented Aug 2, 2024

Uh oh!

martindevans commented Aug 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

martindevans commented Jul 28, 2024 • edited by SignalRT Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

m0nsky commented Aug 1, 2024

Uh oh!

martindevans commented Aug 2, 2024

Uh oh!

SignalRT commented Aug 2, 2024

Uh oh!

SignalRT commented Aug 2, 2024

Uh oh!

SignalRT left a comment

Choose a reason for hiding this comment

Uh oh!

martindevans commented Aug 2, 2024

Uh oh!

martindevans commented Aug 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

martindevans commented Jul 28, 2024 •

edited by SignalRT

Loading