Accurately measuring application startup time is a challenge in large-scale distributed systems. This talk presents an automated methodology to measure and optimize startup latency while reducing variability caused by system background processes, CPU performance scaling, and workload contention.
Tree ensemble methods (Random Forest, Gradient Boosting) are widely used in ML but can be inefficient in cloud-based, multi-threaded environments due to uneven workload distribution across heterogeneous CPU cores. This talk analyzes performance trade-offs in existing ONNX-based implementations, introduces a custom C++ wrapper for optimized task scheduling, and demonstrates a 4x speedup in...
Learn for free, join the best tech learning community for a price of a pumpkin latte.
Event notifications, weekly newsletter
Delayed access to all content
Immediate access to Keynotes & Panels
Access to Circle community platform
Immediate access to all content
Courses, quizes & certificates
Community chats