Newest 'embedding' Questions

0 votes

0 answers

43 views

Low cosine similarity score when using Langchain FAISS wrapper and CLIP Model

I am currently using Langchain wrapper of FAISS vector store and CLIP model. Since Langchain wrapper of FAISS supports only text embedding models, I have implemented my own embedding class(using ...

Aashishkumar

1

asked Apr 7 at 19:05

Tooling

0 votes

2 replies

60 views

Speaker identification/ IVR detection in Indian telephony-calls along with real-time speaker count

I’m working on a Python pipeline for Indian telephony call recordings (mostly 8kHz mono, noisy, multilingual, and with mixed accents). I need to: estimate how many speakers are in the call run speaker ...

Nandini Singhal

1

asked Mar 17 at 11:56

Best practices

0 votes

0 replies

53 views

Best way to generate embeddings for structured product attributes in B2B ecommerce search?

I am building a B2B product search system using vector embeddings and would like advice specifically on how to generate embeddings for structured product attributes. Context Domain: B2B ecommerce ...

Sumit Raj

21

asked Feb 4 at 6:04

0 votes

0 answers

365 views

Problems using local OLLAMA during embedding to ChromaDB

I am running a Python script on a Linux VM without CUDA support and a local OLLAMA. When checking OLLAMA on the console with ollama run jina/jina-embeddings-v2-base-de "Hello world" the ...

Lars Hinrichsen

63

asked Jan 4 at 11:00

0 votes

0 answers

719 views

How to fix EOF Server Ollama Error when embedding

I built the RAG system in Laravel with Neuron AI framework. I tried to store docs to the vectorstore but I got error: [2025-12-04 06:49:45] local.ERROR: Server error: `POST http://localhost:11434/api/...

Leslie Joe

335

asked Dec 4, 2025 at 6:59

2 votes

4 answers

134 views

Loading many PyTorch .pt files from Google Drive in Google Colab is extremely slow

I am storing my ESM protein embeddings as individual .pt files in Google Drive (about 46,000 files, total size ~182 MB). When I try to load them in Google Colab, the process is extremely slow (only ~2 ...

Max__08

21

asked Nov 27, 2025 at 8:59

3 votes

2 answers

213 views

Multimodal embedding requires video first, then image - why?

I am working with OmniEmbed model (https://huggingface.co/Tevatron/OmniEmbed-v0.1), which is built on Qwen2.5 7B. My goal is to get a multimodal embedding for images and videos. I have the following ...

n_arch

76

asked Oct 2, 2025 at 15:07

-1 votes

1 answer

80 views

How to reconstruct sentences from mean-pooled embeddings (embedding inversion) [closed]

I’m working on a research problem where I want to reconstruct or paraphrase sentences starting from synthetic embeddings. The embeddings are global (mean-pooled), not token-level, so they lose ...

melissa mattos

1

asked Sep 30, 2025 at 0:02

1 vote

2 answers

620 views

How can I match the token count used by BGE-M3 embedding model before embedding?

For my particular project, it would be very helpful to know how many tokens the BGE-M3 embedding model would break a string down into before I embed the text. I could embed the string and count the ...

ManBearPigeon

13

asked Sep 2, 2025 at 18:38

0 votes

0 answers

59 views

itext 5.5 add mp4 video

Is it possible to add mp4 videos to itet 5.5 and let them play upon showing? What I tried: final PdfPCell cell = new PdfPCell(); final Chunk chunk = new Chunk(); chunk.setAnnotation(PdfAnnotation ...

Mani Afschar Yazdi

43

asked Jun 11, 2025 at 5:43

1 vote

1 answer

277 views

Rapidfuzz giving no matches but Fuzzywuzzy does

I have been developing a matching system which matches the rows of the client and our central database depending on similarity. I have used a hybrid approach where I needed to somehow map the Company, ...

Prabhjit Singh

21

asked Jun 4, 2025 at 6:21

0 votes

0 answers

94 views

Why does adding token and positional embeddings in transformers work?

In transformer models, I've noticed that token embeddings and positional embeddings are added together before being passed into the attention layers: import torch import torch.nn as nn class ...

Yilmaz

51.4k

asked May 26, 2025 at 21:21

0 votes

0 answers

166 views

Is there a difference of how to use OpenAI Azure API KEY vs actual openAI key?

I want to use open AI to get embeddings for a recommendation system. I used my credentials everything worked. Now I need to use my company's credentials to get embeddings for the whole dataset, ...

Anna Abrahamyan

1

asked May 15, 2025 at 8:26

0 votes

0 answers

67 views

How Can I Make Vimeo's Javascript Player API iframe body Fill instead of Contain?

I want to embed an iframe that contains a vimeo video inside a container div and give that div any ole aspect-ratio and just have my vimeo video conform to that using something like object-fit: cover (...

taber

3,240

asked May 9, 2025 at 22:36

0 votes

0 answers

178 views

Embed Android.Views.View in .net maui

I'm migrating a project from android native to .net maui, by replacing native forms with embedded maui views (following the embedding method described in this MSDN page) However, now I'm facing a ...

Jesoo

104

asked May 6, 2025 at 10:04

0 votes

0 answers

582 views

Qdrant 400 Bad Request Error When Inserting Multi-Vector Embeddings with Larger Batch Sizes via API

Context I am working on a semantic search application and using Qdrant to store three types of embeddings per document: Dense embeddings (from OpenAI) Sparse embeddings (from Qdrant/BM25) Rerank ...

Himanshu Gupta

1

asked Apr 28, 2025 at 9:15

0 votes

0 answers

77 views

Node2vec embeddings for Fraud Detection?

I tried to improve XGBoost by stacking node2vec embeddings to the original features for fraud detection. In literature and papers this is frequently discussed as a good method to enhance prediction ...

Marlon Brando

191

asked Apr 25, 2025 at 5:52

1 vote

1 answer

218 views

Spring-AI embedding, cannot search result by method 'similaritySearching' [closed]

I'm using Spring ai to build an app, the stack I chose is jdk 21, spring-boot 3.3.11 ,spring-ai 1.0.0M6. And for both embedding and chat models, I chose to use Ollama, thus the spring-ai-ollama-spring-...

xiong

19

asked Apr 15, 2025 at 12:39

0 votes

1 answer

186 views

How to validate search terms when using embedding to look for objects in images

I have a search on my site that does both tradition full text search and searches using embeddings. So, for example, when you search 'red balloon' I want both the text and image results. The problem ...

Frank Conry

2,758

asked Apr 5, 2025 at 2:15

-1 votes

1 answer

29 views

How do i embedd a c# app in a gtk window for msys2 mingw?

i have looked at some gtk3 based desktop, used msys2 mingw and some possibly reactos or windows sdk code.Got the taskbar, clock and desktop replicated.. See it here: https://gitlab.com/bluedxca93/11-...

user43787

101

asked Apr 2, 2025 at 4:27

1 vote

1 answer

142 views

Node/edge attributed directed graph embedding

What is the format of node features that graph2vec and GL2vec in karateclub require? It does mention that there should be no string features but with or without it I am running into an error with the ...

Arindam Ghosh

133

asked Mar 14, 2025 at 8:52

0 votes

0 answers

93 views

pytorch after load model state_dict training loss keep increasing

Im training a model and in the first training, no matter how many epochs (10,20,30…) i trained the model, everything looks great! the acc was increasing and the loss was decreasing as well. but when i ...

LOYINuts

9

asked Mar 2, 2025 at 6:52

0 votes

1 answer

77 views

Precision used in ChromaDB Index

I am using BAAI/bge-large-en-v1.5 model to embed and then store these embeddings in ChromaDB vector-store. These embeddings are in the memory and using HNSW indexing. Is there a way I can find out the ...

Jarvis

13

asked Feb 27, 2025 at 18:50

0 votes

1 answer

98 views

Error loading local Emdedding model with Llama index

I'm trying to load a local embedding model for a RAG system using LLama-index using Settings.embed_model="local:./models/multilingual-e5-large/" The path is correct, the embedding model was ...

Carl Brendt

11

asked Feb 20, 2025 at 9:55

-1 votes

2 answers

132 views

Firestore Bandwidth costs for downloading documents with large embedding vectors [closed]

Background I'm developing an application that uses vector embeddings for similarity search. I'm considering storing these embeddings directly in Firestore documents, but I'm concerned about potential ...

Shajeel Afzal

5,943

asked Feb 19, 2025 at 15:53

0 votes

1 answer

64 views

In Qualtrics, what can I do to make my with javasectipt embedded HTML text exactly the same as the default text?

I try to have a table visible for multiple questions on multiple qualtrics pages one after the other. I followed guidelines here: https://community.qualtrics.com/custom-code-12/pin-instructions-to-top-...

canIchangethis

253

asked Jan 31, 2025 at 16:59

0 votes

1 answer

68 views

Gui to view matplotlib figure with forward and backward button

I want to build a GUI to view matplotlib figures with forward and backward buttons. GUI is just similar to view images. To an extent, I could get the results I need. The problem is: that when I use ...

pranjal

1

asked Jan 31, 2025 at 10:30

2 votes

3 answers

3k views

what is the right way to generate ollama embeddings?

In the embedding models documentation, the suggested way to generated embeddings is ollama.embeddings({ model: 'mxbai-embed-large', prompt: 'Llamas are members of the camelid family', }) but ...

punkish

15.7k

asked Jan 17, 2025 at 9:13

0 votes

1 answer

58 views

Issue Embedding Kaltura Video in Sitecore HTML Editor (Inside Rich Text field)

I’m facing an issue embedding a Kaltura video in a Sitecore rich text field. The video works perfectly in a standalone HTML file, but it doesn’t load when added to Sitecore. Here’s the code I’m using: ...

Saurabh Patil

1

asked Jan 11, 2025 at 15:37

0 votes

1 answer

127 views

vector embedding on ontotext similarity plugin

Am new to Ontotext. I understand similarity index of ontotext. It creates vectors of the index data and we can do search upon it. Is there a way to create vectors by other embedding models? Please ...

Vanaja Uthayakumar

1

asked Jan 5, 2025 at 1:01

0 votes

1 answer

403 views

Filter on Azure Search AI vector db based on metadata using Python SDK

I am using Azure Search AI service to store my documents chunk along with some information inside the metadata. However when I try to filter on metadata the filter does not work. Here is an example of ...

Jason_mils

97

asked Dec 13, 2024 at 16:34

1 vote

0 answers

62 views

Embedding of sequence of events sets

My data can be represented as a sequence of events; each event is a set of elements. Events appear one-by-one in time; elements appear all together within the event. Example: event_t1: {A, B, C}, ...

Yury Gubman

65

asked Dec 4, 2024 at 14:42

1 vote

1 answer

173 views

Unable to install Python package llama-index-embeddings-huggingface

I am trying to implement the code discussed in the HuggingFace cookbook article at https://huggingface.co/learn/cookbook/en/rag_llamaindex_librarian. I get to the point where I am trying to install ...

Mark Hamilton

11

asked Nov 25, 2024 at 8:32

2 votes

5 answers

204 views

Segmentation fault when executing a Python script in a C program?

I need to execute some python and C at the same time. I tried using Python.h: #include <Python.h> int python_program(char* cwd) { char* python_file_path; FILE* fd; int run; ...

Yuki

93

asked Nov 20, 2024 at 16:37

0 votes

1 answer

126 views

CNN model for text classification summary showing no output shape and unbuilt params

I am attempting to create a CNN model for text classification based on tweet data with the ELMo embedding. Once the model is built before compiling I am issuing a model summary but the model summary ...

leakie

1

asked Nov 4, 2024 at 15:57

0 votes

0 answers

862 views

How to import a custom embedding model into ollama

I have tried to convert hf model to gguf, and then import into ollama. But I got some problem. When I tried model like m3e-large (https://huggingface.co/moka-ai/m3e-large)，I successfully imported, ...

IronSpiderMan

48

asked Oct 29, 2024 at 7:25

1 vote

1 answer

357 views

How to get the least similar vectors in a vector database

Recently I learned about four-valued logic and the Catuṣkoṭi and I'm interested in applying these ideas to experiments with LLMs and embedding vectors. More specifically, given a large spectrum of ...

sigalor

1,439

asked Oct 23, 2024 at 9:49

0 votes

1 answer

327 views

Azure Managed Grafana - how to embed a graph on an WebApp

I am creating a web application to display a graph created in Azure managed Grafana. I am trying to achieve this using the share->Embed option on the graph panel. It has been challenging. I have ...

KaiT

1

asked Oct 16, 2024 at 5:11

0 votes

1 answer

98 views

Python Thread pool hanging the application

i am currently running the below python script: # Function to process MMR search def process_mmr_search(row, itemdesc): try: formatted_itemdesc = str(row[itemdesc]) print('...

Ashwin Baldawa

49

asked Oct 12, 2024 at 11:53

0 votes

2 answers

3k views

how to batch process in aws bedrock?

based on aws docs provided here , https://docs.aws.amazon.com/code-library/latest/ug/python_3_bedrock-runtime_code_examples.html. In the following example, a model in bedrock is invoked to generate ...

kishi

63

asked Oct 11, 2024 at 1:26

0 votes

2 answers

807 views

Latency issue using TextEmbeddingModel

I'm using Vertex AI's TextEmbeddingModel to calculate embeddings, and the first call shows significantly higher latency than the rest, likely due to caching. However, this isn't context-caching, and ...

user1848018

1,116

asked Oct 10, 2024 at 16:50

0 votes

0 answers

184 views

RAG with Haystack: compiles but returns empty responses

My RAG pipeline (using Haystack) compiles and runs, but is returning empty responses. From my checks, I thought it might be due to the embedding and llm models not being compatible, so I changed to an ...

ArieAI

512

asked Oct 7, 2024 at 13:49

0 votes

1 answer

202 views

Issue integrating Cohere with ChromaDB: "ValueError: setting an array element with a sequence

I'm currently trying to integrate Cohere embeddings into ChromaDB, but I'm facing an issue when adding documents to my Chroma collection. I am using chromadb 0.5.11 and cohere 5.10.0. When I use the ...

Marco Palombo

11

asked Sep 28, 2024 at 18:25

0 votes

1 answer

234 views

Retrieve similarity score in Python for HNSW vector indexing in PostgreSQL

I'm having some trouble returning the score of a vector search on PostgreSQL. I have created a HNSW index on the columns answer_embedding and answer_keyword_embedding. In Python, I've written this ...

MrAssel

1

asked Sep 28, 2024 at 8:03

0 votes

1 answer

350 views

Embedding using the LangChain_AWS is giving None value

I am trying to embed a text using the langchain_aws BedrockEmbeddings, but when I invoke the function, I get a list with the None values. Here's the code: from langchain_community.llms.bedrock import ...

Md Tausif

521

asked Aug 27, 2024 at 16:05

0 votes

2 answers

157 views

Interporating with C embedded struct ("inheritance") in C++

I'm working on a C++ program depends on a C library that uses the "struct embedding" trick to simulate inheritance (of data storage, not behaviour), similar to the Parent and Child structs ...

Alexander

64k

asked Aug 13, 2024 at 1:53

0 votes

1 answer

854 views

Error when using inputs_embeds with generate method

I'm encountering a problem when trying to use inputs_embeds to pass the embedding to my model: ValueError: You passed `inputs_embeds` to `.generate()`, but the model class LlamaForCausalLM doesn't ...

Steven Thorn

1

asked Aug 11, 2024 at 3:07

0 votes

1 answer

472 views

Unable to import SentenceTransformer

I am using Colab, I am trying to import SentenceTransformer: from sentence_transformers import SentenceTransformer However, I got this error: ttributeError Traceback (most ...

A1iMansour

11

asked Jul 18, 2024 at 22:24

2 votes

0 answers

220 views

Error Instantiating EmbeddingModel with ONNX Model intfloat/multilingual-e5-large

I am encountering an error while trying to instantiate the EmbeddingModel using the ONNX model intfloat/multilingual-e5-large. The error message is as follows: Failed to instantiate [org....

Zakaria Hd

21

asked Jul 17, 2024 at 15:25

2 votes

0 answers

706 views

How to use ONNX Transformers model with Spring AI

I am trying to use ONNX export of model intfloat/multilingual-e5-large: https://huggingface.co/intfloat/multilingual-e5-large/tree/main/onnx I have successfully downloaded this model, using it ...

jirka

61

asked Jul 10, 2024 at 14:55

Collectives™ on Stack Overflow