Scikit-learn on GPUs with Array API

Thomas J. Fan
@thomasjpfan github.com/thomasjpfan/pydata-nyc-2023-scikit-learn-array-api

GPU support in scikit-learn ⁉️

Historical Stance 📖

https://scikit-learn.org/stable/faq.html#will-you-add-gpu-support

https://skorch.readthedocs.io/en/stable

scikit-learn v1.2 Array API support (2022)

scikit-learn v1.3 Array API support (2023)

1. scikit-learn API 🖥️

2. Array API Standard 🔬

3. Challenges 🚧

scikit-learn API 🖥️

from sklearn.discriminant_analysis import LinearDiscriminantAnalysis
lda_np = LinearDiscriminantAnalysis()
lda_np.fit(X_np, y_np)
y_pred_np = lda_np.predict(X_np)
type(y_pred_np)
# <class 'numpy.ndarray'>

Enabling Array API support in scikit-learn

Global configuration 🌎

import sklearn
import torch
sklearn.set_config(array_api_dispatch=True)
X_torch_cpu, y_torch_cpu = torch.asarray(X_np), torch.asarray(y_np)
lda = LinearDiscriminantAnalysis()
lda.fit(X_torch_cpu, y_torch_cpu)
type(lda.predict(X_torch_cpu))
# <class 'torch.Tensor'>

Enabling Array API support in scikit-learn

Context Manager 🎬

import sklearn
with sklearn.config_context(array_api_dispatch=True):
    X_torch_cuda = torch.asarray(X_np, device="cuda")
    y_torch_cuda = torch.asarray(y_np, device="cuda")
    lda = LinearDiscriminantAnalysis()
    lda.fit(X_torch_cuda, y_torch_cuda)
    type(lda.predict(X_torch_cuda))
    # <class 'torch.Tensor'>

Performance 🚀

16-core AMD 5950x CPU and Nvidia RTX 3090 GPU

Incremental Support (v1.3)

https://scikit-learn.org/stable/modules/array_api.html

scikit-learn Nightly Build 🌕

https://scikit-learn.org/dev/modules/array_api.html

Array API Standard 🔬

Array Libraries

Consortium for Python Data API Standards

https://data-apis.org

Array API Standard 🔬

https://data-apis.org/array-api/latest/API_specification/index.html

Extensions 🔌

Vision 🔮

NumPy Code

def func(x, y):
    out = np.mean(x, axis=0) - 2 * np.std(y, axis=0)
    return out

Vision 🔮

NumPy Code

def func(x, y):
    out = np.mean(x, axis=0) - 2 * np.std(y, axis=0)
    return out

Array API Code

def func(x, y):
    xp = array_namespace(x, y)
    out = xp.mean(x, axis=0) - 2 * xp.std(y, axis=0)
    return out

Array API support (2022)

✅

import numpy.array_api as xp
import cupy.array_api as xp

🛑

import numpy as np
import cupy as cp

scikit-learn v1.2 Array API support (2022)

import cupy
import cupy.array_api as xp
sklearn.set_config(array_api_dispatch=True)
X_cp, y_cp = cupy.asarray(...), cupy.asarray(...)
X_xp, y_xp = xp.asarray(X_cp), xp.asarray(y_cp)
lda = LinearDiscriminantAnalysis()
lda.fit(X_xp, y_xp)

Meta + Quansight Collaboration

`array_api_compat` 🚀

Extend Array API standard to the main namespace!

https://github.com/data-apis/array-api-compat

Using `array_api_compat` 🚀

from array_api_compat import array_namespace
def func(x, y):
    xp = array_namespace(x, y)
    out = xp.mean(x, axis=0) - 2 * xp.std(y, axis=0)
    return out

Using `array_api_compat` 🚀

from array_api_compat import array_namespace
def func(x, y):
    xp = array_namespace(x, y)
    out = xp.mean(x, axis=0) - 2 * xp.std(y, axis=0)
    return out

Works with 🎯

`array_api_compat` Extend:

NumPy's ndarray
CuPy's ndarray
PyTorch's Tensors

Array API implementations

Numpy Arrays from numpy.array_api
CuPy Arrays from cupy.array_api

scikit-learn v1.3 Array API support (2023)

import torch
sklearn.set_config(array_api_dispatch=True)
X_torch_cpu, y_torch_cpu = torch.asarray(...), torch.asarray(...)
lda = LinearDiscriminantAnalysis()
lda.fit(X_torch_cpu, y_torch_cpu)

Challenges 🚧

API Differences 🔌
Semantic Differences 🪄
Compiled Code 🏗️

API Differences 🔌

Most methods are in the module 📦

NumPy

import numpy as np
y_sum = y.sum(axis=0)

Most methods are in the module 📦

NumPy

import numpy as np
y_sum = y.sum(axis=0)

Array API

from array_api_compat import array_namespace
xp = array_namespace(y)
y_sum = xp.sum(y, axis=0)

Most methods are in the module 📦

NumPy

import numpy as np
y = (X.mean(axis=1) > 1.0).any()

Most methods are in the module 📦

NumPy

import numpy as np
y = (X.mean(axis=1) > 1.0).any()

Array API

xp = array_namespace(x)
y = xp.any(xp.mean(X, axis=1) > 1.0)

Matrix Multiplication 🧮

NumPy

import numpy as np
C = np.dot(A, B)

Matrix Multiplication 🧮

NumPy

import numpy as np
C = np.dot(A, B)

Array API

@ is more restrictive compared to np.dot

C = A @ B

Differences between NumPy and Array API 🎛️

NumPy

import numpy as np
uniques = np.unique(x)
uniques, counts = np.unique(x, return_counts=True)

Differences between NumPy and Array API 🎛️

NumPy

import numpy as np
uniques = np.unique(x)
uniques, counts = np.unique(x, return_counts=True)

Array API

xp = array_namespace(x)
uniques = xp.unique_values(x)
counts = xp.unique_counts(x)

Some NumPy API does not exist in Array API 🎚️

NumPy

import numpy as np
x_mean = np.nanmax(x, axis=1)

Some NumPy API does not exist in Array API 🎚️

Array API

def xp_nanmax(X, axis=None):
    xp = array_namespace(X)
    if is_numpy_namespace(xp):
        return xp.asarray(numpy.nanmax(X, axis=axis))
    # Implement using Array API standard (simplified)
    mask = xp.isnan(X)
    inf_ = xp.asarray(-xp.inf, device=device(X))
    X_nanmax = xp.max(xp.where(mask, inf_, X), axis=axis)
    return X_nanmax

https://github.com/data-apis/array-api/issues/621

Integer Indexing 🔎

NumPy

import numpy as np
x = np.asarray([[1, 2], [4, 5], [4, 1]])
x[[0, 2]]
# array([[1, 2],
#        [4, 1]])

Integer Indexing 🔎

NumPy

import numpy as np
x = np.asarray([[1, 2], [4, 5], [4, 1]])
x[[0, 2]]
# array([[1, 2],
#        [4, 1]])

Array API

Added in the 2022.12 standard

import numpy.array_api as xp
x = xp.asarray([[1, 2], [4, 5], [4, 1]])
xp.take(x, xp.asarray([0, 2]), axis=0)
# Array([[1, 2],
#        [4, 1]], dtype=int64)

Indexing Multiple Dimensions 🔎

NumPy

import numpy as np
x = np.asarray([[1, 2, 3], [4, 5, 6]])
x[1]
# array([4, 5, 6])

Indexing Multiple Dimensions 🔎

NumPy

import numpy as np
x = np.asarray([[1, 2, 3], [4, 5, 6]])
x[1]
# array([4, 5, 6])

Array API

import numpy.array_api as xp
x = xp.asarray([[1, 2, 3], [4, 5, 6]])
x[1]
# IndexError

Indexing Multiple Dimensions 🔎

NumPy

import numpy as np
x = np.asarray([[1, 2, 3], [4, 5, 6]])
x[1]
# array([4, 5, 6])

Array API

import numpy.array_api as xp
x = xp.asarray([[1, 2, 3], [4, 5, 6]])
x[1]
# IndexError

x[1, :]
# array([4, 5, 6])

Random Number Generators 🎮

NumPy

import numpy as np
rng = np.random.default_rng()
x = rng.standard_normal(size=10)

Random Number Generators 🎮

NumPy

import numpy as np
rng = np.random.default_rng()
x = rng.standard_normal(size=10)

Array API

import numpy as np
rng = np.random.default_rng()
x_np = rng.standard_normal(size=10)
xp = array_namespace(x)
x_xp = xp.asarray(x_np, device=device(x))

Order ♟️

rng = np.random.default_rng()
x = rng.standard_normal(size=(10_000, 10_000))
x_c = np.asarray(x, order="C")
x_f = np.asarray(x, order="F")
%%timeit
_ = x_c.sum(axis=0)
# 36.3 ms ± 1.44 ms per loop
%%timeit
_ = x_f.sum(axis=0)
# 18.8 ms ± 131 µs per loop

API Differences

https://numpy.org/doc/stable/reference/array_api.html

Semantic Differences 🪄

Type Promotion ♛

NumPy

import numpy as np
x1 = np.asarray([[1, 2], [4, 5]])
x2 = np.asarray([[1, 2]], dtype=np.float32)
x1 + x2
# array([[2., 4.],
#        [5., 7.]])

Type Promotion ♛

NumPy

import numpy as np
x1 = np.asarray([[1, 2], [4, 5]])
x2 = np.asarray([[1, 2]], dtype=np.float32)
x1 + x2
# array([[2., 4.],
#        [5., 7.]])

Array API

x1 = xp.asarray([[1, 2], [4, 5]])
x2 = xp.asarray([[1, 2]], dtype=xp.float32)
x1 + x2
# TypeError: int64 and float32 cannot be type promoted together

Type Promotion ♛

Workaround

x1 = xp.asarray([[1, 2], [4, 5]], dtype=xp.float32)
x2 = xp.asarray([[1, 2]], dtype=xp.float32)
x1 + x2
# Array([[2., 4.],
#        [5., 7.]], dtype=float32)

Type Promotion ♛: Python Scalars

NumPy

import numpy as np
x1 = np.asarray([[1, 2, 3]])
x2 = 1.0
x1 + x2
# array([[2., 3., 4.]])

Type Promotion ♛: Python Scalars

NumPy

import numpy as np
x1 = np.asarray([[1, 2, 3]])
x2 = 1.0
x1 + x2
# array([[2., 3., 4.]])

Array API

import numpy.array_api as xp
x1 = xp.asarray([[1, 2, 3]])
x2 = 1.0
x1 + x2
# TypeError: Python float scalars can only be promoted with floating-point arrays.

Type Promotion ♛: Python Scalars

Workaround

import numpy.array_api as xp
x1 = xp.asarray([[1, 2, 3]], dtype=xp.float32)
x2 = 1.0
x1 + x2
# Array([[2., 3., 4.]], dtype=float32)

Device 📠

NumPy

import numpy as np
y = np.linspace(2.0, 3.0, num=10)

Device 📠

NumPy

import numpy as np
y = np.linspace(2.0, 3.0, num=10)

Array API

from array_api_compat import device
xp = array_namespace(x)
y = xp.linspace(2.0, 3.0, num=10, device=device(x))

Compiled Code 🏗️

Complied Code in scikit-learn? 🏗️Random Forest 🌲🌲🌲RandomForestClassifier
RandomForestRegressor

Histogram Gradient Boosting 🎄 + 🛹HistGradientBoostingClassifier
HistGradientBoostingRegressor

Linear Models 📈LogisticRegression
PoissonRegressor

Possible SolutionsWorks Now 🪄Convert to NumPy and back - SciPy

Convert to NumPy and back - SciPy

def func(a, b):
    xp = array_namespace(a, b)
    c = xp.sum(a, axis=1) + xp.sum(b, axis=1)
    c = numpy.asarray(c)
    d = compiled_code_that_only_works_with_numpy(c)
    d = xp.asarray(d)
    return d

Possible Solutions

Works Now 🪄

Convert to NumPy and back - SciPy

Dispatching 🔀

uarray - SciPy
Plugins - Scikit-learn
Array library specific code

Dispatching 🔀

def func(a, b, plugin):
    xp = array_namespace(a, b)
    c = xp.sum(a, axis=1) + xp.sum(b, axis=1)
    d = plugin.dispatch_to_library(c)
    e = xp.mean(d, axis=0)
    return e

Array library specific code 📚

def erf(x):
    if is_numpy(x):
        import scipy.special
        return scipy.special.erf(x)
    elif is_cupy(x):
        import cupyx.scipy.special.erf
        import cupyx.scipy.special.erf(x)
    elif is_pytorch(x):
        import torch
        return torch.special.erf(x)
    else:
        ...

Challenges 🚧

API Differences 🔌
Semantic Differences 🪄
Compiled Code 🏗️

Why Adopt the Array API Standard?

Smaller API

Why Adopt the Array API Standard?

Smaller API

Portable

Why Adopt the Array API Standard?

Smaller API

Portable

Performance

Performance 🚀

16-core AMD 5950x CPU and Nvidia RTX 3090 GPU

SciPy

https://scipy.github.io/devdocs/dev/api-dev/array_api.html

Conclusion

User 🧪

Try scikit-learn's Array API feature support
- https://scikit-learn.org/stable/modules/array_api.html

Library Author ✍️

Try using the Array API standard
- Issue tracker: https://github.com/data-apis/array-api

Conclusion

User 🧪

Try scikit-learn's Array API feature support
- https://scikit-learn.org/stable/modules/array_api.html

Library Author ✍️

Try using the Array API standard
- Issue tracker: https://github.com/data-apis/array-api

Thomas J. Fan
@thomasjpfan github.com/thomasjpfan/pydata-nyc-2023-scikit-learn-array-api

↑, ←, Pg Up, k	Go to previous slide
↓, →, Pg Dn, Space, j	Go to next slide
Home	Go to first slide
End	Go to last slide
Number + Return	Go to specific slide
b / m / f	Toggle blackout / mirrored / fullscreen mode
c	Clone slideshow
p	Toggle presenter mode
t	Restart the presentation timer
?, h	Toggle this help

Scikit-learn on GPUs with Array API

GPU support in scikit-learn ⁉️

Historical Stance 📖

scikit-learn v1.2 Array API support (2022)

scikit-learn v1.3 Array API support (2023)

Contents

1. scikit-learn API 🖥️

2. Array API Standard 🔬

3. Challenges 🚧

scikit-learn API 🖥️

scikit-learn API 🖥️

Enabling Array API support in scikit-learn

Global configuration 🌎

Enabling Array API support in scikit-learn

Context Manager 🎬

Performance 🚀

Incremental Support (v1.3)

scikit-learn Nightly Build 🌕

Array API Standard 🔬

Array Libraries

Consortium for Python Data API Standards

Array API Standard 🔬

Extensions 🔌

Vision 🔮

NumPy Code

Vision 🔮

NumPy Code

Array API Code

Array API support (2022)

✅

🛑

scikit-learn v1.2 Array API support (2022)

Meta + Quansight Collaboration

array_api_compat 🚀

Extend Array API standard to the main namespace!

Using array_api_compat 🚀

Using array_api_compat 🚀

Works with 🎯

array_api_compat Extend:

Array API implementations

scikit-learn v1.3 Array API support (2023)

Challenges 🚧

Challenges 🚧

API Differences 🔌

Most methods are in the module 📦

NumPy

Most methods are in the module 📦

NumPy

Array API

Most methods are in the module 📦

NumPy

Most methods are in the module 📦

NumPy

Array API

Matrix Multiplication 🧮

NumPy

Matrix Multiplication 🧮

NumPy

Array API

Differences between NumPy and Array API 🎛️

NumPy

Differences between NumPy and Array API 🎛️

NumPy

Array API

Some NumPy API does not exist in Array API 🎚️

NumPy

Some NumPy API does not exist in Array API 🎚️

Array API

Integer Indexing 🔎

NumPy

Integer Indexing 🔎

NumPy

Array API

Indexing Multiple Dimensions 🔎

NumPy

Indexing Multiple Dimensions 🔎

NumPy

Array API

Indexing Multiple Dimensions 🔎

NumPy

`array_api_compat` 🚀

Using `array_api_compat` 🚀

Using `array_api_compat` 🚀

`array_api_compat` Extend: