[CPU-sampling] Avoid access/init cuda instances at each sampling (child) process

In PyTorch dataloader (cpu sampling), worker processes will never initialize CUDA context, as CUDA runtime does not support `fork` start method (https://pytorch.org/docs/stable/notes/multiprocessing.html#cuda-in-multiprocessing). I think DGL dataloader should also follow this convention, if possible. 

However, here, when creating coo/csr matrix, it would actually call this `cudaPointerGetAttributes` CUDA runtime API inside `IsPinned` function. As a result, each worker would call this constructor and start init/access cuda instances.

https://github.com/dmlc/dgl/blob/fedaa36da50dd39bb97ef33d04dc5921b08e1eb2/include/dgl/aten/coo.h#L68-L71

It's not a bug and will not error out as such behavior is guarded here by clearing the cuda error msg (see below):
https://github.com/dmlc/dgl/blob/b35757a0068d7c5faa2e9954f65308a436420e58/src/runtime/cuda/cuda_device_api.cc#L295-L301

Nevertheless, I believe it would still be preferable to adhere to PyT's convention by removing the `IsPinned` function from the constructor of the coo/csr matrix.

I can come up with a PR for the fix later. cc. @nv-dlasalle @yaox12 @frozenbugs @TristonC 

	// We don't want to fail in these particular cases since this function
	// can be called when users only want to run on CPU even if CUDA API is
	// enabled, or in a forked subprocess where CUDA context cannot be
	// initialized. So we just mark the CUDA context to unavailable and
	// return.
	is_available_ = false;
	cudaGetLastError(); // clear error

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CPU-sampling] Avoid access/init cuda instances at each sampling (child) process #6561

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	is_pinned = (aten::IsNullArray(row) \|\| row.IsPinned()) &&
	(aten::IsNullArray(col) \|\| col.IsPinned()) &&
	(aten::IsNullArray(data) \|\| data.IsPinned());
	}

[CPU-sampling] Avoid access/init cuda instances at each sampling (child) process #6561

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions