Solution method

Here we describe our algorithmic approach to solve the routing problem

\[\begin{array}{ll} \text{maximize} & U(\Psi) \\ \text{subject to} & \Psi = \sum_{i=1}^m A_i(\Lambda_i - \Delta_i) \\ & \phi_i(R_i + \gamma_i\Delta_i - \Lambda_i) = \phi_i(R_i), \quad i = 1, \dots, m \\ &\Delta_i \geq 0, \quad \Lambda_i \geq 0, \quad i = 1, \dots, m. \end{array}\]

Overview

Our approach is a common one in large scale optimization: we decompose the problem such that the optimal trade on each individual CFMM can be solved independently. It is easy to see that the only constraint coupling the CFMM trades together is

\[\Psi = \sum_{i=1}^m A_i(\Lambda_i - \Delta_i).\]

Therefore, if we can eliminate this constraint, the subproblems can be solved independently. Intuitively, our approach is to relax this constraint to be a penalty in the objective, where there is some cost of violation. If we fix these costs at a given round, the CFMM subproblems can be solved independently. We then use these subproblem solutions to update the cost of violation and iterate this process. The algorithmic details are explained below.

Partial dualization of the constraints

Motivated by the insight above, we form the (partial) Lagrangian

\[\mathcal{L}(\Psi, \Delta, \Lambda, \nu) = U(\Psi) - \nu^T(\Psi - \sum_{i=1}^m A_i(\Lambda_i - \Delta_i)) - \sum_{i=1}^m \mathbf{I}_i(\Delta_i, \Lambda_i),\]

where $\nu$ is a dual variable and

\[\mathbf{I}_i(\Delta_i, \Lambda_i) = \begin{cases} 0 & \Delta_i, \Lambda_i \geq 0, \text{ and } \phi_i(R_i + \gamma_i\Delta_i - \Lambda_i) \geq \phi(R_i) \\ \infty & \text{otherwise}. \end{cases}\]

In other words $I_i$ is an indicator function which is 0 if the trade $(\Delta_i, \Lambda_i)$ satisfies the CFMM's constraints and is positive infinity otherwise.

Notice that we only introduce a dual variable for the coupling constraint. The corresponding dual function, found by minimization over the primal variables, is

\[\begin{aligned} g(\nu) &= \sup_{\Psi, \Delta_i, \Lambda_i} \mathcal{L}(\Psi, \Delta, \Lambda, \nu) \\ &= \sup_\Psi \left(U(\Psi) - \nu^T\Psi\right) + \sum_{i=1}^m \sup_{\Delta_i, \Lambda_i}\left( (A_i^T\nu)^T(\Lambda_i - \Delta_i) -\mathbf{I}_i(\Delta_i, \Lambda_i) \right) \\ &= (-U)^*(-\nu) + \sum_{i=1}^m \mathbf{arb}_i(A^T\nu), \end{aligned}\]

where $\mathbf{arb}_i(A^T\nu)$ is the optimal arbitrage profit from CFMM $i$ with external "market prices" $A^T\nu$ and $(-U)^*$ is the Fenchel conjugate of $-U$. The optimal arbitrage problem for each CFMM $i$ can be solved in parallel as the problems are fully independent. This problem corresponds to the following convex optimization problem:

\[\begin{array}{ll} \text{maximize} & (A_i^T\nu)^T(\Lambda_i - \Delta_i)\\ \text{subject to} & \phi_i(R_i + \gamma_i\Delta_i - \Lambda_i) \geq \phi_i(R_i) \\ &\Delta_i \geq 0, \quad \Lambda_i \geq 0. \end{array}\]

The general arbitrage problem, for arbitrary, convex trading functions $\phi$, can be efficiently solved with standard techniques (e.g., a primal-dual interior point method). On the other hand, for many common CFMMs (e.g., Uniswap v2) this problem has a closed form solution (c.f., Appendix A[2]) which we exploit in our solver.

Minimizing the dual problem

As a result of strong duality, minimizing $g(\nu)$ is equivalent to solving the original problem in that the optimal values are equal. After optimizing $g(\nu)$, we can reconstruct the trade using the $\Delta_i$'s and $\Lambda_i$'s found in the sub problems.

To minimize $g(\nu)$, we use L-BFGS-B[3], which requires evaluation of $g(\nu)$ and $\nabla g(\nu)$. Note that the gradient of the Fenchel conjugate $\nabla f^*(y)$ is the $x$ at which the supremum $\sup_x (y^Tx - f(x))$ is attained (this may not be a unique point but instead be a set of points—called the subdifferential—in which case we simply choose any $x$ in this set).

The gradient $\nabla_\nu \mathbf{arb}_i(A_i^T\nu) = A_i (\Lambda_i^* - \Delta_i^*)$, where $\Lambda^*$ and $\Delta_i^*$ are the optimal values associated with $\mathbf{arb}_i(A_i^T\nu)$. Thus,

\[\begin{aligned} \nabla g(\nu) &= -\Psi^* + \sum_{i=1}^m A_i (\Lambda_i^* - \Delta_i^*). \end{aligned}\]

By evaluating the function $g(\nu)$, we get the gradient essentially for free.

Extensions and notes

In the near future, we will support user-created CFMMs as well, which are specified by the trading function $\phi$, its gradient $\nabla \phi$, and its Hessian $\nabla^2\phi$. The gradient and Hessian may be specified exactly or by using automatic differentiation tools such as ForwardDiff.jl [4]. This method could be extended to include gas fees and uncertain transaction execution (probabalistic constraints).

When using this method with real-world data, numerical issues (caused by the magnitude of and order of magnitude differences between numbers) may need to be carefully addressed via appropriate scaling. Additionally, the dual decomposition technique used will not necessarily yield a feasible $\Psi$ when the trading function $\phi$ is not strictly concave [5] (e.g., for a constant sum CFMM). In this case, extra care needs to be taken during implementation.

References