Yes, a multi-cell system (8 or 16 balancers) on one board makes more sense. The microcontroller can sense if there is a voltage present, and if not, just act as a message relay. Also the main controller can go on the board, plus any other stuff needed such as optoisolator, current sensors, LCD display drivers, control buttons, and current cutoff FETs. The LCD display could be eliminated if there is remote console. So by stackable I do not mean individual balancers, but a daisy chain. They would want to be on one board.
Simplicity is paramount. For me there is no difference between a cost of $0.5 or $5 per MC, if the better MC has more storage and is easier to program/debug, better ADC, Rx Tx, etc. I don't have the experience, so I cannot really comment on stackable vs. Leaf BMS. I would consider Leaf BMS to be more complex, and certainly less flexible. For example the stackable can be used for LFP or 4.35V LiPo. And there will be other battery technologies in the future coming down the pipeline.
Of course when I get to the point where I am able to program an ATtiny or ATmega, I would know more about the advantages/disadvantages.