Beantwortet
How to use Sum and Dot function in GPU computation with arrayfun?
You cannot perform vector operations in GPU arrayfun. The a and b arguments to your function are not the whole array, they are t...

mehr als ein Jahr vor | 0

| akzeptiert

Beantwortet
Double precision in deep learning
This is possible with a |dlnetwork| but it is (currently) more of a workaround than anything else. Once your dlnetwork is ready ...

mehr als ein Jahr vor | 0

| akzeptiert

Beantwortet
When I want to train Fully convolutional neural network, I have the following error
I suspect your best bet here is to upgrade MATLAB to a more recent version. Many of these bugs will be fixed with newer versions...

mehr als ein Jahr vor | 0

Beantwortet
Error saying ''Dot Indexing is not supported for variables of this type".
This could be a bug, especially if you didn't modify the example code. What is the data you passed to |adamupdate|?

mehr als ein Jahr vor | 0

Beantwortet
YoloV4 - Out of memory
Generally the best solution here is to reduce the size of the input data. Still, these object detector networks do seem to be...

mehr als ein Jahr vor | 1

| akzeptiert

Beantwortet
Why is my GPU code faster with the profiler on in RTX GPUs?
This is due to an optimization which is not performing ideally under memory pressure. If you reduce the size of your input you'l...

mehr als ein Jahr vor | 0

| akzeptiert

Beantwortet
Conflicting behaviour of arrayfun() with gpu: example that works and example of error
The function normcdf isn't supported by GPU arrayfun because it accepts varargin. For a list of supported functions see the docu...

mehr als ein Jahr vor | 0

| akzeptiert

Beantwortet
How to initialize a string variable, and pass it to the matlab function using GPU coder
MATLAB and Simulink code generation do not currently support string. Edit: Sorry, my bad, it does support scalar strings, but n...

mehr als ein Jahr vor | 0

| akzeptiert

Beantwortet
need to plot the accuracy vs epoch graph
Add Plots="training-progress" to your training options. FWIW, you shouldn't use ReadFcn for resizing images, it dramatically sl...

mehr als ein Jahr vor | 1

Beantwortet
Update BatchNorm Layer State in Siamese netwrok with custom loop for triplet and contrastive loss
Interesting question! The purpose of batch norm state is to collect statistics about typical inputs. In a normal Siamese workflo...

mehr als ein Jahr vor | 0

| akzeptiert

Beantwortet
gpu arrayfun don't support linspace or NaN array
You cannot create an array inside a call to GPU arrayfun, only scalars.

mehr als ein Jahr vor | 0

Beantwortet
GPU Support for RTX 4090
Forgive me for needing to correct Walter, but the last three versions of MATLAB _will_ natively support the 4000 series because,...

mehr als ein Jahr vor | 2

Beantwortet
mexcuda gives unsupported GNU version error
R2022a uses CUDA 11.2, not 11.7. I suspect that the actual compiler that ends up being used is the version of nvcc shipped with ...

mehr als ein Jahr vor | 0

| akzeptiert

Beantwortet
GPU speed up for pcg() is disappointing
I'm guessing LL' is extremely dense, which will explain why the solver stalls. On the GPU the preconditioning is (currently) per...

mehr als ein Jahr vor | 0

| akzeptiert

Beantwortet
How to implement Siamese network with the two subnetworks not share weights
You can try gathering the weights back from each network after you've used it, as in net = dlupdate(@gather,net). This should sa...

mehr als ein Jahr vor | 0

Beantwortet
Speed up inference or/and training of a 3D deep neural network (U-net) for a regression task
Have you tried using dlaccelerate? As well as ensuring any Custom Layers are using the Acceleratable mixin?

mehr als ein Jahr vor | 1

| akzeptiert

Beantwortet
Matrix multiplication optimization using GPU parallel computation
The Windows Task Manager lets you track GPU utilization and memory graphically, and the utility nvidia-smi lets you do it in a t...

mehr als ein Jahr vor | 1

Beantwortet
How to increase MiniBatchSize
It depends on what you're doing. Some ideas: * Get a new GPU with more memory * Use a smaller model * If your model accepts...

mehr als ein Jahr vor | 0

Beantwortet
Matlab trainNetwork CNN training pauses iterating intermittently at random then continues
Is the pause associated with a validation measurement being added to the training plot? With 7 times as much validation data it ...

mehr als ein Jahr vor | 0

Beantwortet
problems with @arrayfun on GPU
This is a bug. I have reported it. Thanks for finding it! In the meantime, you can work around the issue by using a local funct...

mehr als ein Jahr vor | 0

| akzeptiert

Beantwortet
A problem when using "multi-gpu" as "ExecutionEnvironment" for training a CNN
Most likely this is this issue, which is fixed in the latest update to R2022a. You can also try downgrading your GPU drivers.

mehr als ein Jahr vor | 0

| akzeptiert

Beantwortet
Perform mldivide between 3x3 matrix M and every RGB pixel in a image in GPU
I feel like I'm missing something - this is just a single backslash with multiple right-hand sides, or to avoid permutation a si...

fast 2 Jahre vor | 1

Beantwortet
Library not loaded: @rpath/libcudart.10.2.dylib
This problem should now be fixed at Apple, please reboot and report here if you are still experiencing issues.

fast 2 Jahre vor | 0

Beantwortet
Warning: GPU is low on memory
A 3-D U-net is a very large model. Try reducing |patchSize|, |patchPerImage|, |miniBatchSize| and |inputSize|.

fast 2 Jahre vor | 0

| akzeptiert

Beantwortet
How to run lane detection optimized with GPU coder project on matlab
https://www.mathworks.com/help/gpucoder/ug/lane-detection-optimized-with-gpu-coder.html

fast 2 Jahre vor | 0

Beantwortet
Dedicated GPU Memory Usage - Permanently increases every time code is run
This error means you ran out of GPU memory. I can't reproduce any sort of memory leak in R2022a. It's possible that you are perm...

fast 2 Jahre vor | 1

Beantwortet
minibatchqueue function cannot generate the expected MiniBatchSize
You've asked your arrayDatastore to iterate over the rows because that's the default. So as far as arrayDatastore is concerned, ...

fast 2 Jahre vor | 1

| akzeptiert

Beantwortet
RTX 3090 vs A100 in deep learning.
According to the spec as documented on Wikipedia, the RTX 3090 has about 2x the maximum speed at single precision than the A100,...

fast 2 Jahre vor | 0

| akzeptiert

Beantwortet
GPUCoder does not generate parallelized code
This looks about right to me, because your kernel is too simple and you're transferring data from and to the CPU on every call. ...

fast 2 Jahre vor | 1

Mehr laden