I am studying a model where landmarks from an image are calculated. The work comes from Convolutional Experts Constrained Local Model for 3D Facial Landmark Detection.
I need to confirm why the convolutional layer of 200 11 is outputting 200 by n~ *n~. my guess is that it is 200 * n~ *n~ with kernel size 1, but I need to be sure.
Can someone please guide.
Thanks in advance [1]: https://openaccess.thecvf.com/content_ICCV_2017_workshops/papers/w36/Zadeh_Convolutional_Experts_Constrained_ICCV_2017_paper.pdf [2]: https://i.sstatic.net/Pi9Nq.png [3]: How does Sigmoid activation work in multi-class classification problems
