Measuring CPU cache line size using Python code
Published:
Estimating with a small experiment, the cache line size of a cpu (Apple M1 in this case) using Python code.
Published:
Estimating with a small experiment, the cache line size of a cpu (Apple M1 in this case) using Python code.
Published:
Understanding with a small experiment, the feature of pytorch of (almost) always having computations in GPU in an asynchronous fashion.
Published:
Notes about some possible in-place operation cases with tensors in pytorch.
Published:
Useful post about how one can think about matrix-vector multiplication in two different ways, which can be useful in different contexts.
Published:
Guide to thinking about what happens when one calls tensor.backward() in pytorch.