Rieoptax Riemannian Optimization in JAX

2025-04-15 0 0 804.44KB 16 页 10玖币

侵权投诉

Rieoptax: Riemannian Optimization in JAX

Saiteja Utpala∗Andi Han†Pratik Jawanpuria‡Bamdev Mishra‡

Abstract

We present Rieoptax, an open source Python library for Riemannian optimization in JAX.

We show that many diﬀerential geometric primitives, such as Riemannian exponential and

logarithm maps, are usually faster in Rieoptax than existing frameworks in Python, both on CPU

and GPU. We support various range of basic and advanced stochastic optimization solvers like

Riemannian stochastic gradient, stochastic variance reduction, and adaptive gradient methods.

A distinguishing feature of the proposed toolbox is that we also support diﬀerentially private

optimization on Riemannian manifolds.

1 Introduction

Riemannian geometry is a generalization of the Euclidean geometry [

] to general Riemannian

manifolds. It includes several nonlinear spaces such as the set of positive deﬁnite matrices [

105

Grassmann manifold of subspaces [

], Stiefel manifold of orthogonal matrices [

Kendall shape spaces [

], hyperbolic spaces [

108

107

], and special Euclidean and orthogonal

group [98, 42, 101], to name a few.

Optimization with manifold based constraints has become increasingly popular and has been

employed in various applications such as low rank matrix completion [

], learning taxonomy

embeddings [

], neural networks [

], density estimation [

], optimal

transport [

], shape analysis [

103

], and topological dimension reduction [

among others.

In addition, privacy preserving machine learning [

102

] has become crucial

in real applications, which has been generalized to manifold-constrained problems very recently

[

109

]. Nevertheless, such a feature is absent in existing Riemannian optimization libraries

[23, 17, 78, 70, 100, 106, 79].

In this work, we introduce Rieoptax (

Rie

mannian

Opt

imization in J

), an open source Python

library for Riemannian optimization in JAX [

]. The proposed library is mainly driven by the

needs of eﬃcient implementation of manifold-valued operations and optimization solvers, readily

compatible with GPU and even TPU processors as well as the needs of privacy-supported Riemannian

optimization. To the best of our knowledge, Rieoptax is the ﬁrst library to provide privacy guarantees

within the Riemannian optimization framework.

∗Independent (saitejautpala@gmail.com).

†University of Sydney (andi.han@sydney.edu.au).

‡Microsoft India (pratik.jawanpuria@microsoft.com,bamdevm@microsoft.com).

arXiv:2210.04840v1 [math.OC] 10 Oct 2022

1.1 Background on Riemannian optimization, privacy, and JAX

Riemannian optimization. Riemannian optimization [5, 21] considers the following problem

min

w∈M f(w),(1)

where

M → R

, and

denotes a Riemannian manifold. Instead of considering

(1)

as a

constrained problem, Riemannian optimization [

] views it as an unconstrained problem on the

manifold space. Riemannian (stochastic) gradient descent [

112

] generalizes the Euclidean gradient

descent with intrinsic updates on manifold, i.e.,

wt+1

Expwt

(

−ηtgradf

(

)), where

gradf

(

)is

the Riemannian (stochastic) gradient,

Expw

(

)is the Riemannian exponential map at

and

ηt

the step size. Recent years have witnessed signiﬁcant advancements for Riemannian optimization

where more advanced solvers are generalized from the Euclidean space to Riemannian manifolds.

These include variance reduction methods [

111

114

], adaptive gradient methods

[

], accelerated gradient methods [

113

], quasi-Newton methods [

], zeroth-order

methods [

] and second order methods, such as trust region methods [

] and cubic regularized

Newton’s methods [6].

Diﬀerential privacy on Riemannian manifolds.

Diﬀerential privacy (DP) provides a rigorous

treatment for data privacy by precisely quantifying the deviation in the model’s output distribution

under modiﬁcation of a small number of data points [

]. Provable guarantees of DP

coupled with properties like immunity to arbitrary post-processing and graceful composability have

made it a de-facto standard of privacy with steadfast adoption in the real applications [

Further, it has been shown empirically that DP models resist various kinds of leakage attacks that

can cause privacy violations [91, 26, 95, 115, 13].

Recently, there is a surge of interest on diﬀerential privacy over Riemannian manifolds, which

has been explored in the context of Fréchet mean [

] computation [

109

] and, more generally,

empirical risk minimization problems where the parameters are constrained to lie on a Riemannian

manifold [49].

JAX and its ecosystem.

JAX [

] is recently introduced machine learning framework which

support automatic diﬀerentiation capabilities [

] via

grad()

. Further some of the distinguishing

features of JAX are just-in-time (JIT) compilation using the accelerated linear algebra (XLA)

compiler [

] via

jit()

, automatic vectorization (batch-level parallelism) support with

vmap()

, and

strong support for parallel computation via

pmap()

. All the above transformations can be composed

arbitrarily because JAX follows the functional programming paradigm and implements these as pure

functions.

Given that JAX has many interesting features, its ecosystem has been constantly expanding in the

last couple of years. Examples include neural network modules (Flax [

], Haiku [

], Equinox [

Jraph [

], Equivariant-MLP [

]), reinforcement learning agents (Rlax [

]), Euclidean optimization

algorithms (Optax [

]), federated learning (Fedjax [

]), optimal transport toolboxes (Ott [

]),

sampling algorithms (Blackjax [

]), diﬀerential equation solvers (Diﬀrax [

]), rigid body simulators

(Brax [40]), and diﬀerentiable physics (Jax-md [97]), among others.

1.2 Rieoptax

We believe that the proposed framework for Riemannian optimization in JAX is a timely contribution

that brings several beneﬁts of JAX and new features (such as privacy support) to the manifold

optimization community discussed below.

•Automatic and eﬃcient vectorization with vmap().

Functions that are written for inputs

of size 1can be converted to functions that take batch of inputs by wrapping it with

vmap()

For example, the function

def dist(point_a, point_b)

for computing distance between a

single

point_a

and a single

point_b

can be converted to function that computes distance

between a batch of

point_a

and/or a batch

point_b

by wrapping

dist

with

vmap()

without

modifying the

dist()

function. This is useful in many cases, e.g., Fréchet mean computation

minw∈M 1

nPn

i=1 fi(w) := 1

nPn

i=1 dist2(w, zi)

. Furthermore, vectorization with

vmap()

usually faster or on par with manual vectorization [24].

•Per-example gradient clipping.

A key process in diﬀerentially private optimization is

per-example gradient clipping

nPn

i=1 clipτ

(

gradfi

(

)) , where

clipτ

ensures norm is atmost

Here, the order of operations is important: the gradients are ﬁrst clipped and then averaged.

Popular libraries including Autograd [

], Pytorch [

] and Tensorﬂow [

] are heavily optimized

to directly compute the mean gradient

nPn

i=1 gradfi

(

)and hence do not expose per-example

gradients i.e.,

gradfi

(

)

Hence, one has to resort to ad-hoc techniques [

] or come up

with algorithmic modiﬁcations [

] which inherently have speed versus performance trade-oﬀ.

JAX, however, oﬀers native support for handling such scenarios and JAX-based diﬀerentially

private Euclidean optimization methods have been shown to be much faster than their non-

JAX counterparts [

104

]. We observe that JAX oﬀer similar beneﬁts for diﬀerentially private

Riemannian optimization as well.

•Single Source Multiple Devices (SSMD) paradigm.

JAX follows the SSMD paradigm,

and hence, the code written for CPUs can be run on GPU/TPUs without any additional

modiﬁcation.

Rieoptax is available at https://github.com/SaitejaUtpala/Rieoptax/.

2 Design and Implementation overview

The package currently implements several commonly used geometries, optimization algorithms and

diﬀerentially private mechanisms on manifolds. More geometries and advanced solvers will be added

in the future.

2.1 Core

•rieoptax.core.ManifoldArray:

lightweight wrapper of the

jax

device array with

manifold

attribute and used to model array constrained to manifold. It is registered as

Pytree

to ensure

compatibility jax primitives like grad() and vmap().

•rieoptax.core.rgrad: Riemannian gradient operator.

2.2 Geometries

Geometry module contains manifolds equipped with diﬀerent Riemannian metrics. Each Geom-

etry contains Riemannian inner product

inp()

, induced norm

norm()

, Riemannian exponential

exp()

, logarithm maps

log()

, induced Riemannian distance

dist()

, parallel transport

pt()

, and

transformation from the Euclidean gradient to Riemannian gradient egrad_to_rgrad().

Manifolds include symmetric positive deﬁnite (SPD) matrices

SPD

(

) :=

{X∈Rm×m

X>,X

}

, hyperbolic space, Grassmann manifold

(

m, r

) :=

{

[

] :

X∈Rm×r,X>X

where

[

] :=

{XO

O∈O

(

)

}

(

)denotes the orthogonal group and hypersphere

(

) :=

{x∈Rd

x>x

= 1

}

. We use

TxM

to represent the tangent space at

and

hu, vix

to represent the Riemannian

inner product. For more detailed treatment on these geometries, we refer to [5, 21, 108].

•rieoptax.geometry.spd.SPDAffineInvariant:

SPD matrices with the aﬃne-invariant met-

ric [88]: SPD(m)with hU,ViX= tr(X−1UX−1V)for U,V∈TXSPD(m).

•rieoptax.geometry.spd.SPDLogEuclidean:

SPD matrices with the Log-Euclidean metric

[

]: SPD(

)with

hU,ViX

tr

Ulogm

(

Vlogm

(

)



where D

Ulogm

(

)is the direc-

tional derivative of matrix logarithm at Xalong U.

•rieoptax.geometry.hyperbolic.PoincareBall:

the Poincare-ball model of Hyperbolic space

with Poincare metric [

108

], i.e.,

(

) :=

{x∈Rd

x>x<

}

with

hu,vix

= 4

u>v/

−x>x

)

for u,v∈TxD(d).

•rieoptax.geometry.hyperbolic.LorentzHyperboloid:

the Lorentz Hyperboloid model of

Hyperbolic space [

108

], i.e.,

(

) =

{x∈Rd

hx,xiL

−

}

with

hu,vix

hu,viL

for

u,v∈TxH(d), where hu,viL:= −u0v0+u1v1+· · · ud−1vd−1.

•rieoptax.geometry.grassmann.GrassmannCanonicalMetric:

the Grassmann manifold with

the canonical metric [36], i.e., G(m, r)with hU,ViX= trUTVfor U,V∈TXG(m, r).

•rieoptax.geometry.hypersphere.HypersphereCanonicalMetric:

the hypersphere mani-

fold which canonical metric [5, 21], i.e., S(d)with hu,vix=u>vfor u,v∈TxS(d).

2.3 Optimizers

Optimizers module contains Riemannian optimization algorithms. Design of optimizers follows

Optax [

], which implements every optimizer by chaining of few common transformations. Where

every optimizer

•riepotax.optimizers.first_order.rsgd: Riemannian stochastic gradient descent [20].

•riepotax.optimizers.first_order.rsvrg:

Riemannian stochastic variance reduced gradi-

ent descent [111].

•riepotax.optimizers.first_order.rsrg:

Riemannian stochastic recursive gradient descent

[65].

•riepotax.optimizers.first_order.rasa:

Riemannian adaptive stochastic gradient algo-

rithm [64].

•riepotax.optimizers.zeroth_order.zo_rgd:

zeroth-order Riemannian gradient descent

[75].

文档加载中……请稍候！
如果长时间未打开，您也可以点击刷新试试。

下载文档到电脑，查找使用更方便

10 玖币 0人已下载

立即下载

摘要：

Rieoptax:RiemannianOptimizationinJAXSaitejaUtpala*AndiHanPratikJawanpuriaBamdevMishraAbstractWepresentRieoptax,anopensourcePythonlibraryforRiemannianoptimizationinJAX.Weshowthatmanydierentialgeometricprimitives,suchasRiemannianexponentialandlogarithmmaps,areusuallyfasterinRieoptaxthanexistingfra...

展开>> 收起<<

Rieoptax Riemannian Optimization in JAX.pdf

共16页,预览4页

还剩页未读，继续阅读

声明：本站为文档C2C交易模式，即用户上传的文档直接被用户下载，本站只是中间服务平台，本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间，仅对用户上传内容的表现方式做保护处理，对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私，请立即通知玖贝云文库，我们立即给予删除！

Rieoptax Riemannian Optimization in JAX

相关推荐

开通VIP享超值会员特权

作者详情

相关内容

热门标签

举报选择: