> So if I understand what you're saying, a 1x1 convolutional layer for collapsing 100 channels to 10 channels would take a 100x512x512 tensor and collapse it to a 10x512x512 tensor?
Correct. As I understand it, this would be applying a 1x1 covolution with 10 filters to a 100x512x512 tensor.
Correct. As I understand it, this would be applying a 1x1 covolution with 10 filters to a 100x512x512 tensor.