1,438 questions
0
votes
0
answers
43
views
Low cosine similarity score when using Langchain FAISS wrapper and CLIP Model
I am currently using Langchain wrapper of FAISS vector store and CLIP model. Since Langchain wrapper of FAISS supports only text embedding models, I have implemented my own embedding class(using ...
Tooling
0
votes
2
replies
60
views
Speaker identification/ IVR detection in Indian telephony-calls along with real-time speaker count
I’m working on a Python pipeline for Indian telephony call recordings (mostly 8kHz mono, noisy, multilingual, and with mixed accents). I need to:
estimate how many speakers are in the call
run speaker ...
Best practices
0
votes
0
replies
53
views
Best way to generate embeddings for structured product attributes in B2B ecommerce search?
I am building a B2B product search system using vector embeddings and would like advice specifically on how to generate embeddings for structured product attributes.
Context
Domain: B2B ecommerce
...
0
votes
0
answers
365
views
Problems using local OLLAMA during embedding to ChromaDB
I am running a Python script on a Linux VM without CUDA support and a local OLLAMA. When checking OLLAMA on the console with
ollama run jina/jina-embeddings-v2-base-de "Hello world"
the ...
0
votes
0
answers
719
views
How to fix EOF Server Ollama Error when embedding
I built the RAG system in Laravel with Neuron AI framework. I tried to store docs to the vectorstore but I got error:
[2025-12-04 06:49:45] local.ERROR: Server error: `POST http://localhost:11434/api/...
2
votes
4
answers
134
views
Loading many PyTorch .pt files from Google Drive in Google Colab is extremely slow
I am storing my ESM protein embeddings as individual .pt files in Google Drive (about 46,000 files, total size ~182 MB).
When I try to load them in Google Colab, the process is extremely slow (only ~2 ...
3
votes
2
answers
213
views
Multimodal embedding requires video first, then image - why?
I am working with OmniEmbed model (https://huggingface.co/Tevatron/OmniEmbed-v0.1), which is built on Qwen2.5 7B. My goal is to get a multimodal embedding for images and videos. I have the following ...
-1
votes
1
answer
80
views
How to reconstruct sentences from mean-pooled embeddings (embedding inversion) [closed]
I’m working on a research problem where I want to reconstruct or paraphrase sentences starting from synthetic embeddings.
The embeddings are global (mean-pooled), not token-level, so they lose ...
1
vote
2
answers
620
views
How can I match the token count used by BGE-M3 embedding model before embedding?
For my particular project, it would be very helpful to know how many tokens the BGE-M3 embedding model would break a string down into before I embed the text. I could embed the string and count the ...
0
votes
0
answers
59
views
itext 5.5 add mp4 video
Is it possible to add mp4 videos to itet 5.5 and let them play upon showing?
What I tried:
final PdfPCell cell = new PdfPCell();
final Chunk chunk = new Chunk();
chunk.setAnnotation(PdfAnnotation
...
1
vote
1
answer
277
views
Rapidfuzz giving no matches but Fuzzywuzzy does
I have been developing a matching system which matches the rows of the client and our central database depending on similarity. I have used a hybrid approach where I needed to somehow map the Company, ...
0
votes
0
answers
94
views
Why does adding token and positional embeddings in transformers work?
In transformer models, I've noticed that token embeddings and positional embeddings are added together before being passed into the attention layers:
import torch
import torch.nn as nn
class ...
0
votes
0
answers
166
views
Is there a difference of how to use OpenAI Azure API KEY vs actual openAI key?
I want to use open AI to get embeddings for a recommendation system.
I used my credentials everything worked. Now I need to use my company's credentials to get embeddings for the whole dataset, ...
0
votes
0
answers
67
views
How Can I Make Vimeo's Javascript Player API iframe body Fill instead of Contain?
I want to embed an iframe that contains a vimeo video inside a container div and give that div any ole aspect-ratio and just have my vimeo video conform to that using something like object-fit: cover (...
0
votes
0
answers
178
views
Embed Android.Views.View in .net maui
I'm migrating a project from android native to .net maui, by replacing native forms with embedded maui views (following the embedding method described in this MSDN page)
However, now I'm facing a ...
0
votes
0
answers
582
views
Qdrant 400 Bad Request Error When Inserting Multi-Vector Embeddings with Larger Batch Sizes via API
Context
I am working on a semantic search application and using Qdrant to store three types of embeddings per document:
Dense embeddings (from OpenAI)
Sparse embeddings (from Qdrant/BM25)
Rerank ...
0
votes
0
answers
77
views
Node2vec embeddings for Fraud Detection?
I tried to improve XGBoost by stacking node2vec embeddings to the original features for fraud detection. In literature and papers this is frequently discussed as a good method to enhance prediction ...
1
vote
1
answer
218
views
Spring-AI embedding, cannot search result by method 'similaritySearching' [closed]
I'm using Spring ai to build an app, the stack I chose is jdk 21, spring-boot 3.3.11 ,spring-ai 1.0.0M6. And for both embedding and chat models, I chose to use Ollama, thus the spring-ai-ollama-spring-...
0
votes
1
answer
186
views
How to validate search terms when using embedding to look for objects in images
I have a search on my site that does both tradition full text search and searches using embeddings. So, for example, when you search 'red balloon' I want both the text and image results. The problem ...
-1
votes
1
answer
29
views
How do i embedd a c# app in a gtk window for msys2 mingw?
i have looked at some gtk3 based desktop, used msys2 mingw and some possibly reactos or windows sdk code.Got the taskbar, clock and desktop replicated.. See it here: https://gitlab.com/bluedxca93/11-...
1
vote
1
answer
142
views
Node/edge attributed directed graph embedding
What is the format of node features that graph2vec and GL2vec in karateclub require? It does mention that there should be no string features but with or without it I am running into an error with the ...
0
votes
0
answers
93
views
pytorch after load model state_dict training loss keep increasing
Im training a model and in the first training, no matter how many epochs (10,20,30…) i trained the model, everything looks great! the acc was increasing and the loss was decreasing as well. but when i ...
0
votes
1
answer
77
views
Precision used in ChromaDB Index
I am using BAAI/bge-large-en-v1.5 model to embed and then store these embeddings in ChromaDB vector-store. These embeddings are in the memory and using HNSW indexing.
Is there a way I can find out the ...
0
votes
1
answer
98
views
Error loading local Emdedding model with Llama index
I'm trying to load a local embedding model for a RAG system using LLama-index using
Settings.embed_model="local:./models/multilingual-e5-large/"
The path is correct, the embedding model was ...
-1
votes
2
answers
132
views
Firestore Bandwidth costs for downloading documents with large embedding vectors [closed]
Background
I'm developing an application that uses vector embeddings for similarity search. I'm considering storing these embeddings directly in Firestore documents, but I'm concerned about potential ...
0
votes
1
answer
64
views
In Qualtrics, what can I do to make my with javasectipt embedded HTML text exactly the same as the default text?
I try to have a table visible for multiple questions on multiple qualtrics pages one after the other. I followed guidelines here: https://community.qualtrics.com/custom-code-12/pin-instructions-to-top-...
0
votes
1
answer
68
views
Gui to view matplotlib figure with forward and backward button
I want to build a GUI to view matplotlib figures with forward and backward buttons.
GUI is just similar to view images.
To an extent, I could get the results I need.
The problem is: that when I use ...
2
votes
3
answers
3k
views
what is the right way to generate ollama embeddings?
In the embedding models documentation, the suggested way to generated embeddings is
ollama.embeddings({
model: 'mxbai-embed-large',
prompt: 'Llamas are members of the camelid family',
})
but ...
0
votes
1
answer
58
views
Issue Embedding Kaltura Video in Sitecore HTML Editor (Inside Rich Text field)
I’m facing an issue embedding a Kaltura video in a Sitecore rich text field. The video works perfectly in a standalone HTML file, but it doesn’t load when added to Sitecore.
Here’s the code I’m using:
...
0
votes
1
answer
127
views
vector embedding on ontotext similarity plugin
Am new to Ontotext. I understand similarity index of ontotext. It creates vectors of the index data and we can do search upon it. Is there a way to create vectors by other embedding models? Please ...
0
votes
1
answer
403
views
Filter on Azure Search AI vector db based on metadata using Python SDK
I am using Azure Search AI service to store my documents chunk along with some information inside the metadata. However when I try to filter on metadata the filter does not work.
Here is an example of ...
1
vote
0
answers
62
views
Embedding of sequence of events sets
My data can be represented as a sequence of events; each event is a set of elements. Events appear one-by-one in time; elements appear all together within the event.
Example: event_t1: {A, B, C}, ...
1
vote
1
answer
173
views
Unable to install Python package llama-index-embeddings-huggingface
I am trying to implement the code discussed in the HuggingFace cookbook article at https://huggingface.co/learn/cookbook/en/rag_llamaindex_librarian.
I get to the point where I am trying to install ...
2
votes
5
answers
204
views
Segmentation fault when executing a Python script in a C program?
I need to execute some python and C at the same time.
I tried using Python.h:
#include <Python.h>
int python_program(char* cwd)
{
char* python_file_path;
FILE* fd;
int run;
...
0
votes
1
answer
126
views
CNN model for text classification summary showing no output shape and unbuilt params
I am attempting to create a CNN model for text classification based on tweet data with the ELMo embedding. Once the model is built before compiling I am issuing a model summary but the model summary ...
0
votes
0
answers
862
views
How to import a custom embedding model into ollama
I have tried to convert hf model to gguf, and then import into ollama. But I got some problem.
When I tried model like m3e-large (https://huggingface.co/moka-ai/m3e-large),I successfully imported, ...
1
vote
1
answer
357
views
How to get the least similar vectors in a vector database
Recently I learned about four-valued logic and the Catuṣkoṭi and I'm interested in applying these ideas to experiments with LLMs and embedding vectors. More specifically, given a large spectrum of ...
0
votes
1
answer
327
views
Azure Managed Grafana - how to embed a graph on an WebApp
I am creating a web application to display a graph created in Azure managed Grafana. I am trying to achieve this using the share->Embed option on the graph panel.
It has been challenging. I have ...
0
votes
1
answer
98
views
Python Thread pool hanging the application
i am currently running the below python script:
# Function to process MMR search
def process_mmr_search(row, itemdesc):
try:
formatted_itemdesc = str(row[itemdesc])
print('...
0
votes
2
answers
3k
views
how to batch process in aws bedrock?
based on aws docs provided here , https://docs.aws.amazon.com/code-library/latest/ug/python_3_bedrock-runtime_code_examples.html. In the following example, a model in bedrock is invoked to generate ...
0
votes
2
answers
807
views
Latency issue using TextEmbeddingModel
I'm using Vertex AI's TextEmbeddingModel to calculate embeddings, and the first call shows significantly higher latency than the rest, likely due to caching. However, this isn't context-caching, and ...
0
votes
0
answers
184
views
RAG with Haystack: compiles but returns empty responses
My RAG pipeline (using Haystack) compiles and runs, but is returning empty responses.
From my checks, I thought it might be due to the embedding and llm models not being compatible, so I changed to an ...
0
votes
1
answer
202
views
Issue integrating Cohere with ChromaDB: "ValueError: setting an array element with a sequence
I'm currently trying to integrate Cohere embeddings into ChromaDB, but I'm facing an issue when adding documents to my Chroma collection. I am using chromadb 0.5.11 and cohere 5.10.0.
When I use the ...
0
votes
1
answer
234
views
Retrieve similarity score in Python for HNSW vector indexing in PostgreSQL
I'm having some trouble returning the score of a vector search on PostgreSQL. I have created a HNSW index on the columns answer_embedding and answer_keyword_embedding.
In Python, I've written this ...
0
votes
1
answer
350
views
Embedding using the LangChain_AWS is giving None value
I am trying to embed a text using the langchain_aws BedrockEmbeddings, but when I invoke the function, I get a list with the None values.
Here's the code:
from langchain_community.llms.bedrock import ...
0
votes
2
answers
157
views
Interporating with C embedded struct ("inheritance") in C++
I'm working on a C++ program depends on a C library that uses the "struct embedding" trick to simulate inheritance (of data storage, not behaviour), similar to the Parent and Child structs ...
0
votes
1
answer
854
views
Error when using inputs_embeds with generate method
I'm encountering a problem when trying to use inputs_embeds to pass the embedding to my model:
ValueError: You passed `inputs_embeds` to `.generate()`, but the model class LlamaForCausalLM doesn't ...
0
votes
1
answer
472
views
Unable to import SentenceTransformer
I am using Colab, I am trying to import SentenceTransformer:
from sentence_transformers import SentenceTransformer
However, I got this error:
ttributeError Traceback (most ...
2
votes
0
answers
220
views
Error Instantiating EmbeddingModel with ONNX Model intfloat/multilingual-e5-large
I am encountering an error while trying to instantiate the EmbeddingModel using the ONNX model intfloat/multilingual-e5-large. The error message is as follows:
Failed to instantiate [org....
2
votes
0
answers
706
views
How to use ONNX Transformers model with Spring AI
I am trying to use ONNX export of model intfloat/multilingual-e5-large:
https://huggingface.co/intfloat/multilingual-e5-large/tree/main/onnx
I have successfully downloaded this model, using it ...