Serving Engine Advisor

Optimize inference for latency and throughput