I am a PhD researcher at the University of Turku, working on efficient execution of AI workloads on heterogeneous computing platforms. My research focuses on runtime systems and optimization techniques for multi-DNN pipelines and transformer inference on edge devices.
My work studies how modern AI models can be scheduled and optimized across CPU, GPU and NPU systems while minimizing latency, improving throughput and reducing energy consumption.
My earlier work spans neural signal processing, low-power IC design, and approximate computing for wearable biomedical devices.