Parallelism Strategy Advisor

Optimize TP, PP, and DP for distributed training