I specialize in numerical methods and performance optimization for scientific computing, with a growing focus on GPU kernel design and solver acceleration. Drawing on my PhD and experience with PETSc and Nek5000, I work at the intersection of algorithm design, memory hierarchy tuning, and scalable HPC software. My goal is to devise mathematically sound solutions that clearly navigate the trade-offs between accuracy and performance.

Education
  • 2003 - University of Bucharest (Romania) - BSc in Mathematics
  • 2007 - Royal Institute of Technology (Sweden) - MSc Numerical Analysis
  • 2012 - Royal Institute of Technology (Sweden) - PhD Numerical Analysis
Professional

I have been affiliated with the following software projects

  • Computational Fluid Dynamics high-order software - Nek5000/NekRS (Fortran77 programming language)
  • Linear Algebra library - PETSc (C programming language)
  • Parallel finite element framework - MOOSE (C++ programming language)
Scientific community service work

Areas of research I have been active in, and selected publications

  • GPU-focused research and development
    • CUDA kernel development for matrix reordering and memory access optimization
    • GEMM strategies: tuning and performance modeling
    • LLVM and IR transformations for kernel performance
    • Lower-precision control and rounding strategies to accelerate GPU kernels while maintaining numerical robustness
    • The PetscSF Scalable Communication Layer - enabling asynchronous CUDA/NVSHMEM data movement on Summit’s V100 GPUs
  • High Performance Computing
  • ML/AI and Data Science
  • Numerical Analysis - Method development
  • PDE-constrained Optimization
  • Computational Fluid Dynamics - Analysis, Software and Simulations
  • Emails

    • Work: oanam@peracompute.org
    • Personal: oana_marin@outlook.com
    Online presence Previous webpage, not currently maintained