SYCL™ Projects
There are many SYCL projects being developed and maintained by a cross section of the community. This page contains some of those projects.
Project Name  Main Contributor 
Benchmarks 

A BUDE virtualscreening benchmark, in many programming models 
University of Bristol 
STREAM, for lots of devices written in many programming models 
University of Bristol 
Breadthfirstsearch and PageRank in SYCL benchmarked against Galois and CUDA implementations on NVIDIA® GPU 
University of Texas 
This repository contains a collection of dataparallel programs for evaluating oneAPI direct programming. Each program is written with CUDA, SYCL, and OpenMP4.5 target offloading. 
ANL 
Heat code from OpenMP Target training 
University of Bristol 
The purpose of this benchmark tool is to evaluate performance bounds of GPUs on mixed operational intensity kernels 
Parallel Scientific Computing Lab 
The purpose of this test program is to experiment with various "performance portability" frameworks and libraries. 
CERN 
A miniapp to represent the multipole resonance representation lookup cross section algorithm 
ANL 
The Rodinia Benchmark in SYCL 
ANL 
SYCL Bench
SYCL Benchmark Suite 
University of Salerno, University of Heidelberg, University of Innsbruck 
The Monte Carlo Macroscopic Cross Section Lookup Benchmark 
ANL 
Efficient evaluation of dense potential kernels on CPUs and GPUs via SYCL 
UK Exascale Project, UCL 
Frameworks 

The alpaka library is a headeronly C++14 abstraction library for accelerator development. 
HZDR, LBNL and CASUS 
Software Framework for Block Structured AMR Used by the WarpX project and 7 other projects reference the AMReX project in the documentation 
LBNL 
Athena is a high performance scalable deep learning framework. It represents computation through computation graph and operates tensors 
Intel® 
KernelFaRer: Replacing NativeCode Idioms with HighPerformance Library Calls 
University of Alberta 
Kokkos C++ Performance Portability Programming EcoSystem: The Programming Model  Parallel Execution and Memory Abstraction There are many applications that use Kokkos, some are listed on the Kokkos website and there are >30 confirmed applications using Kokkos. On GitHub there are > 600 projects that reference Kokkos in the documentation 
ORNL, Sandia National Laboratories 
Public development project of the LAMMPS MD software package 
ORNL, Sandia Research Lab 
A library exposing a lightweight CAPI for SYCL to use in Python applications including Numba 
Intel® 
A Generalpurpose Parallel and Heterogeneous Task Programming System 
University of Utah 
The Toolkit for Adaptive Stochastic Modeling and NonIntrusive ApproximatioN 
ORNL 
TensorFlow is an endtoend open source platform for machine learning. Over 99,000 projects on GitHub reference TensorFlow. 
Google® 
An applicationfocused API for memory management on NUMA & GPU architectures 
LLNL 
Libraries 

Antares is an automatic engine for multiplatform kernel generation and optimization 
Microsoft® 
ArborX is an opensource library designed to provide performance portable algorithms for geometric search, similarly to nanoflann and Boost Geometry. 
ORNL 
Performanceportable library for particlebased simulations 
Argonne 
C/C++ library wrapping cuBLAS, rocBLAS, and DPC++ oneAPI MKL blas (USM version). Implementation is C++, the interface provided is C and designed for easy calling from Fortran code. 
University of Chicago, ANL 

CERN 
Simple code to assemble the Poisson equation on manycore architectures using Dolfinx and SYCL. 
Universtiy of Cambridge, UK Excalibur Project 
Eigen is a C++ template library for linear algebra: matrices, vectors, numerical solvers, and related algorithms. 

FTK is a library that scales, simplifies, and delivers feature tracking algorithms for scientific datasets. You may use FTK as ParaView plugins, Python bindings, or command line interface. 
ANL, Ohio State University 
Ginkgo is a highperformance linear algebra library for manycore systems, with a focus on sparse solution of linear systems 
French national research institute for digital science and technology 
OpenGL Mathematics (GLM) is a header only C++ mathematics library for graphics software based on the OpenGL Shading Language (GLSL) specifications. 

This library provides data parallel C++ container classes with internal memory layout that is transformed to map efficiently to SIMD architectures 
University of Edinburgh, EPCC 
Gtensor is a multidimensional array C++14 headeronly library for hybrid GPU development 
University of Chicago, ANL 
oneAPI Data Analytics Library 
Intel® 
oneAPI Deep Neural Network Library 
Intel® 
Dataparallel molecular dynamics simulator for Intel oneAPI 
Individual 
oneAPI Math Kernel Library Interfaces 
Intel® 
OP2 is an API with associated libraries and preprocessors to generate parallel executables for applications on unstructured grids 
University of Warwick 
Parallelization and acceleration abstractions for code and performance portability 
UCL, DiRAC, Institute of Astronomy from KU Leuven 
Partitioned Global Address Space This directory contains the Global Arrays (GA), Communications Runtime for Exascale (ComEx) runtime library, Aggregate Remote Memory Copy Interface (ARMCI) runtime library, Memory Allocator (MA), parallel I/O libraries (DRA,EAF,SF), TCGMSG, and TCGMSGMPI packages bundled together 
Pacific Northwest National Laboratory 
PETSc, pronounced PETsee (the S is silent), is a suite of data structures and routines for the scalable (parallel) solution of scientific applications modeled by partial differential equations. 
ANL 
PICSAR is a high performance repository intended to help scientists porting their ParticleInCell (PIC) codes to the next generation of exascale computers. 
LBNL, ANL, CERN 
The objective of the Software for Linear Algebra Targeting Exascale (SLATE) project is to provide fundamental dense linear algebra capabilities to the US Department of Energy and to the highperformance computing (HPC) community at large 
DoE 
Generic Stencil Simulation Library that can execute arbitrary Moorelike stencil kernels on arbitrary cell types in arbitrary buffers, using FPGAs. 
Paderborn Center for Parallel Computing 
An implementation of BLAS using the SYCL open standard 
Codeplay™ 
SYCLDNN is a library implementing neural network algorithms written using SYCL 
Codeplay 
SYCLML is a C++ library, implementing classical machine learning algorithms using SYCL 
Codeplay 
Vectorised data model base and helper classes. 
CERN 
VectorizationParallelismandGPGPUprogramming This repository aims to accelerate various algortihms using vectorization, parallelism and GPGPU programming. Any sort of recommendations, improvements and even contributions are welcome. 

A machine vision library written in SYCL and C++ that shows performanceportable implementation of graph algorithms 
Codeplay 
A simple C++ framework for performance portability and Fortran code porting 
ORNL 
Scientific 

Experimentindependent toolkit for (charged) particle track reconstruction in (high energy) physics experiments implemented in modern C++ 
CERN 
AMRWind is a massively parallel, blockstructured adaptivemesh, incompressible flow sover for wind turbine and wind farm simulations. 
NREL, LBNL 
The ATLAS Experiment's main offline software repository 
CERN 
They implement 2nd order epistasis detection, with singleobjective optimization, using the K2 score objective function. 
University Lisbon 
Highorder exhaustive epistasis detection using K2 Bayesian scoring 
Inesc id Lisbon 
Simulates sparse symmetric CSB COO format times dense block vector, a key operation for the LOBPCG solver in the MFDn configuration interaction code. 
NERSC, LBNL 
DICOMautomaton is a multipurpose tool for analyzing medical physics data with a focus on automation 
University of British Columbia 
Distributed knearest Neighbors using Locality Sensitive Hashing and SYCL 
Scientific Computing (SC) and Simulation of Large Systems (SGS) @ University of Stuttgart 
NaluWind is a generalized, unstructured, massively parallel, incompressible flow solver for wind turbine and wind farm simulations 
National Renewable Energy Laboratory (NREL) 
GROMACS 
Stockholm University 
A 2D Hydro code for benchmarking purpose 

This repository contains code developed in the context of porting the MadGraph5_aMC@NLO event generator software onto GPU hardware 
CERN 
A mini app simulating weatherlike flows for training in parallelizing accelerated HPC architectures. This uses the YAKL library which is developing SYCL support 
ORNL 
An adaptive mesh, Nbody hydro cosmological simulation code 
LBNL, NREL, Argonne National Labs 

Universidade Federal de Sao Carlos 
Implementing scalable, heterogeneousparallelprocessing algorithms that can be used quickly and accurate decode digital data stored in synthetic DNA generated by project OligoArchive 
EURECOM 
Various benchmarks used to inform PSyclone optimisations 
STFC 
TAU is a program and performance analysis tool framework 
DOE, ECP, and ASC program at University of Oregon 
This is an implementation of different bioinformatics algorithms for protein analysis, based on the Viterbi algorithm 
JetBrains 