Skip to main content
Filter by
Sorted by
Tagged with
Advice
0 votes
1 replies
69 views

I am looking to find a book on PyTorch that is suitable for beginners, Ive used sklearn in the past for ML its a simple workflow for me prepare the X and Y data, fit/train a model, and make ...
Kev's user avatar
  • 1
0 votes
0 answers
15 views

I have tried loading PyTorch in iPython but get a DLL initialization error. In a normal python console it works fine, as below. Windows 10, Miniconda installation. Can anyone advise how I need to ...
PGajjar's user avatar
0 votes
1 answer
54 views

I have to train a convolutional neural network on a dataset. The NN itself works and does what it's supposed to but now I want to make a confusion matrix and export it into a json file for further ...
traq's user avatar
  • 1
0 votes
0 answers
46 views

I'm currently training a GRU model on American Sign Language (ASL) using a Kaggle dataset, while tweaking the parameters I achieved a peak accuracy of 44.7% on training and 28.2% on testing, that is ...
Bakr MARHFOUL's user avatar
Advice
1 vote
1 replies
34 views

If I'm using any transformer model loaded from the Hugging Face Hub with Python, is it somehow possible to reproduce all the seeds, that have been used for the model training/fine-tuning? Seeds/...
jhn's user avatar
  • 5
Advice
0 votes
1 replies
31 views

Dependency layer: The layers whose outputs are passed to the current layer. Basically, The current layer is dependent on the outputs of the dependency layers. For a project, I need to know if the ...
Mentor_sensei's user avatar
0 votes
1 answer
48 views

PyTorch supports Intel GPU through torch.xpu, but PyTorch Lightning does not currently have built-in XPU accelerator support. Because NeuralForecast uses Lightning under the hood, that also blocks ...
Marek Ozana's user avatar
Best practices
1 vote
0 replies
26 views

I’ve been working on multiple machine learning projects using PyTorch, and I keep running into the same issue: a lot of code ends up being repeated across projects. This includes things like: ...
AIMLSE support's user avatar
0 votes
0 answers
35 views

I found torch-text is archived, and I still want to use it ,because there is a course that uses it. but Since it was archived, I always meet signture missing problem, so I want to fork and fix it for ...
SteinGate's user avatar
0 votes
0 answers
58 views

I'm using XTTS v2 fine-tuned for Vietnamese (vnTTS). Problem: - Running inference on a single sentence → perfect output - Running inference on multiple sentences in a loop → weird sounds/...
Duy Đỗ Đình's user avatar
-1 votes
0 answers
98 views

I have an .h5 file that contains only model weights, not the model architecture. I want to use these weights in a PyTorch model and convert them into a .pth file. Some context: The .h5 file does not ...
Saim Mahmood's user avatar
3 votes
0 answers
36 views

I'm doing some research about the information encoding with LLMs and need to find a way to quantize the weights of the MLP layers(MoE) to 4 bits and even customized mixed precision. Consider from ...
ShoutOutAndCalculate's user avatar
-2 votes
0 answers
133 views

I'm trying to run model on GPU: clf2 = PunctCapSegModelONNX.from_pretrained( "1-800-BAD-CODE/xlm-roberta_punctuation_fullstop_truecase", ort_provider=["CUDAExecutionProvider&...
xæliudzyh's user avatar
Advice
0 votes
4 replies
86 views

Need to extract data (Peptide_Sequence) from XLSX output file to a FASTA file. I'm using pandas. A FASTA file is a standard plain-text format in bioinformatics, used to store nucleotide or amino acid ...
MomoLtx's user avatar
0 votes
1 answer
60 views

I want to setup to use only CPU (saves e.g. space) and for a package it's described in e.g. How to install torch without nvidia? pip install p --index-url https://download.pytorch.org/whl/cpu. Now I ...
Martian2020's user avatar
3 votes
1 answer
82 views

I am building an autonomous AI Agent (managing training workflows) that automatically generates PyTorch/OpenMMLab training scripts and executes them in a background subprocess. One of the common ...
user32496676's user avatar
Best practices
0 votes
2 replies
50 views

I use google colab to train my model I have trained the model on fp32 and use random grid to search the hyperparameter. the training phase is slow; it takes around 3.24it/s. I want to ask can I use ...
Suebpong Pruttipattanapong's user avatar
Best practices
5 votes
5 replies
132 views

I am currently developing an AI-driven fashion recommendation system, specifically focusing on a "Multi-modal Context-Aware Decision Model." A critical component of my recommendation engine ...
王振宇's user avatar
Advice
0 votes
1 replies
58 views

I am experimenting with the V-JEPA model developed by Meta for video understanding. My goal is to analyze a live video stream of people attending a seminar and determine their engagement level (for ...
Harshitha Gangu's user avatar
0 votes
0 answers
65 views

I use this code to load sentence transformer in a GCP VM instance (no GPU). This is a dask plugin used on dask worker.: class NLPSetup(WorkerPlugin): def __init__(self, bucket_uri): self....
cuneyttyler's user avatar
  • 1,414
Advice
2 votes
0 replies
141 views

I am working on a person recognition system for learning purposes. My goal is: Maintain a small gallery of known people (multiple images per person) Given a new query image, return the most similar ...
Shanthini M's user avatar
3 votes
0 answers
98 views

I wrote a PPO-based reinforcement learning code for the Gymnasium CarRacing-v3 environment. (The code was generated with the help of Gemini) However, even after 200,000 frames, the training does not ...
Rai Madu's user avatar
0 votes
0 answers
58 views

I encountered a problem with metrics fading after the first training epoch. During the first epoch, the model training proceeds normally. The loss metrics decrease, and the accuracy increases. The ...
CumMunist's user avatar
4 votes
2 answers
153 views

I am using pytorch lightning to train my model, here I use the lightning callback ModelCheckpoint, with the following settings: ModelCheckpoint( dirpath="path/to/dir", monitor="...
JacobM's user avatar
  • 41
5 votes
0 answers
164 views

I am using an RTX 3060 (12GB VRAM) and implementing a RAG pipeline with the BGE-M3 embedding model. Initially, I installed PyTorch with the CUDA 12.8 wheel (my NVIDIA driver supports CUDA 12.9). ...
Sujith A's user avatar
2 votes
1 answer
67 views

I tried making a logistic regression model using nn.Module class LogisticRegressionModel(nn.Module): def __init__(self, input_dim= None) -> None: super().__init__() if input_dim ...
Bakr MARHFOUL's user avatar
Advice
1 vote
3 replies
90 views

Running help(torch.Tensor.mul) gives: Help on method_descriptor: mul(...) unbound torch._C.TensorBase method mul(value) -> Tensor See :func:`torch.mul`. What does unbound mean in this ...
escapecharacter's user avatar
3 votes
2 answers
124 views

on Python 3.13, torch 2.10.0+cu130 import torch loss = nn.CrossEntropyLoss() loss(torch.tensor((.1, .2)), torch.tensor((.3, .4))) returns - tensor(0.4811) but why does nn.CrossEntropyLoss(torch....
user2309803's user avatar
1 vote
0 answers
70 views

I am training a small PyTorch model and want to use nn.BatchNorm1d. When the batch size is 1 and the model is in training mode, I get the error below; ValueError: Expected more than 1 value per ...
Linda's user avatar
  • 37
0 votes
0 answers
45 views

I trained a CycleGAN model on Google Colab using this repository - https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix The model should enhance dark images. I tested the model on my test dataset ...
rupertpurple's user avatar
1 vote
1 answer
69 views

I am currently working on a project where I have to use GluonTS (the DeepAREstimator and DLinearEstimator). At the beginning it worked well. But now, even when I use the example code from the GluonTS ...
peter mafai's user avatar
2 votes
2 answers
60 views

If I run a py module with only these imports (no additional code) it works fine and the output is Process finished with exit code 0: import torch.utils.data from torch.utils.data.dataloader import ...
Takewood's user avatar
0 votes
0 answers
32 views

I got a pretrained resnet 18 model from this lane detection repo in order to use it as an ADAS(advanced driver assistance systems) function for an electric car making competition. My current goal is ...
Ekim's user avatar
  • 3
0 votes
0 answers
37 views

I am implementing a Reinforcement Learning environment using torchrl where the agent uses an LSTM-based policy. My goal is to train the agent on sequences sampled from a replay buffer. While I have ...
wittn's user avatar
  • 318
0 votes
0 answers
46 views

I'm trying to train a character-level GRU on Linux kernel source but the training loop keeps crashing with this error: RuntimeError: view size is not compatible with input tensor's size and stride (...
user avatar
1 vote
0 answers
72 views

I am utilizing PyTorch for federated experiments. As my experiments involves 50 datasets with models, so, I have to run multiple ML models experiments parallelly. The code for training ML model is ...
coderoid's user avatar
  • 2,400
0 votes
0 answers
40 views

I have a Dataset that is based on IterableDataSet, looking like that class MyDataSet(torch.utils.data.IterableDataset): def __init__(self): # doing init stuff here def __iter__(self): ...
RaJa's user avatar
  • 1,597
Advice
2 votes
1 replies
55 views

# Example of target with class indices loss = nn.CrossEntropyLoss() input = torch.randn(3, 5, requires_grad=True) <=============== WHY ? target = torch.empty(3, dtype=torch.long).random_(5) output =...
VISHMA PRATIM DAS's user avatar
6 votes
0 answers
136 views

About I am trying to create a Docker image with the same Dockerfile with Python 3.10, CUDA 12.8, and PyTorch 2.8 that is portable between two machines: Local Machine: NVIDIA RTX 5070 (Blackwell ...
requiemman's user avatar
0 votes
0 answers
244 views

I am currently on a project to segment 3D-LSM images using self-supervised model and i have been trying to perform a dryrun(testing pre-training) on the AMD GPU droplet on digitalocean. the configs of ...
Manav Patel's user avatar
1 vote
1 answer
57 views

I'm learning GPU programming with PyTorch and I'm confused about when torch.cuda.synchronize() is actually necessary. I have this code that compares CPU and GPU matrix multiplication: import torch ...
nz_21's user avatar
  • 7,841
1 vote
1 answer
101 views

Let S be a nxn symmetric matrix and v a n 1-dimensional vector. We need to compute inside a pytorch loss function the vector (S x v) in an efficient manner. Do you know if there is a way to keep ...
Filippo Portera's user avatar
0 votes
0 answers
45 views

I am training an Encoder-Decoder network to reconstruct brain CT images. Due to OOM (Out of Memory) errors with full-sized images, I implemented a sliding window approach for training and inference. ...
kKodorna's user avatar
2 votes
1 answer
76 views

I'm very new to PyTorch and am attempting to create a dataset for which a given sample has both unmasked and masked data associated with it, or in other words, the first piece of data is just the ...
user1799323's user avatar
3 votes
1 answer
68 views

class SoftmaxRegission(torch.nn.Module): linear: torch.nn.Linear def __init__(self, num_features: int, num_classes: int): super(SoftmaxRegission, self).__init__() self.linear =...
SteinGate's user avatar
0 votes
0 answers
46 views

I am comparing many deep learning models to each other, including UNETR, on the BTCV dataset and noticed a discrepancy in the reported number of parameters. In their paper titled "UNETR: ...
Ahmed's user avatar
  • 163
0 votes
0 answers
71 views

I need fast compute but the resulting sums need higher precision for downstream tasks just for one specific op out of many in the model. torch.bmm has an out_dtype parameter but the documentation does ...
LOST's user avatar
  • 3,384
0 votes
2 answers
137 views

I’m learning PyTorch and I see two common classes: torch.Tensor and torch.nn.Module. I’m a bit confused about their differences and when to use each. Here’s what I understand so far: torch.Tensor ...
Kokala Sai Teja's user avatar
0 votes
0 answers
91 views

Everything I describe here works perfectly fine on my computer that has Windows 11 (Version 10.0.26200). However, on a computer that has Windows 10 (10.0.19045), it does not work. This is a client's ...
FluidMechanics Potential Flows's user avatar
0 votes
0 answers
57 views

I want to use the mobile_sam.pt model in web browser, so I need the onnx format of that. I tried these method, but always getting the same error. segment-anything - samexporter The error above ...
Erfan Riahi's user avatar

1
2 3 4 5
478