Gehan Zheng's blog

使用图着色(graph coloring)加速Gauss-Seidel迭代

2025-01-02T11:41:24.000Z

Jacobi和Gauss-Seidel都是用于求解线性方程组的迭代方法。Jacobi迭代很容易实现在GPU上并行的版本，但是收敛速度很慢甚至有可能发散。Gauss-Seidel迭代并不能用类似Jacobi的方法并行，因为它是一个依赖于顺序的迭代方法。但是Gauss-Seidel迭代的收敛速度比Jacobi快很多。因此出现了用图论的图着色算法加速Gauss-Seidel迭代的研究。

Hermite Interpolation

2024-10-30T04:52:19.000Z

Hermite插值多项式

Hermite插值是一种插值方法，可以通过给定的点和导数值构造插值多项式。给定点\((x_0,y_0),(x_1,y_1),...,(x_n,y_n)\)和导数值\(y'_0,y'_1,...,y'_n\)，可以构造插值多项式。本文介绍如何使用Newton差商生成Hermite插值多项式。

Point triangle distance

2024-09-15T06:17:09.000Z

介绍

该文章中的c++代码参考了ipc-toolkit的实现。

点到三角形的距离可以用如下定义：

\[ \begin{aligned} \text{distance}(\vec{\mathbf{x_p}}, \vec{\mathbf{x_t}_1}, \vec{\mathbf{x_t}_2}, \vec{\mathbf{x_t}_3}) &= \min_{\beta_1, \beta_2} \left\| \vec{\mathbf{x_p}} - ( \vec{\mathbf{x_t}_1} + \beta_1 (\vec{\mathbf{x_t}_2} - \vec{\mathbf{x_t}_1}) + \beta_2 (\vec{\mathbf{x_t}_3} - \vec{\mathbf{x_t}_1}) ) \right\| \\&s.t. \beta_1 \geq 0, \beta_2 \geq 0, \beta_1 + \beta_2 \leq 1 \end{aligned} \]

这是一个分断连续的函数，实际计算时可以根据点和三角形的位置关系分以下几种情况讨论，首先要将点投影到三角形所在的平面上：

投影后，点在三角形内部，此时距离为点到三角形所在平面的距离。
投影后，点在三角形的某个边朝外的半平面且投影在边上的点在边上，此时距离为点到边的距离。
其他情况，此时距离为点到三角形的三个顶点的最小距离。

Stress hessian computation in FEM

2024-08-08T10:05:16.000Z

介绍

在进行软体模拟时，如果使用牛顿法计算最优的下降方向，需要计算能量密度函数\(\Psi\)关于位置\(\vec {\mathbf{x}}\) 的Hessian矩阵，即\(\frac{\partial^2 \Psi}{\partial \vec {\mathbf{x}}^2}\)。其中\(\vec {\mathbf{x}}\)是一个四面体的四个顶点的位置。

\[ \begin{equation} \frac{\partial^2 \Psi}{\partial {\mathbf{x}^2}} = \text{vec}(\frac{\partial F}{\partial {\mathbf{x}}})^T \text{vec}(\frac{\partial P}{\partial F}) \text{vec}(\frac{\partial F}{\partial {\mathbf{x}}}) \end{equation} \]

Force computation in FEM

2024-05-02T14:43:25.000Z

证明1

首先，已知

\[ \vec {\mathbf{x}} = \begin{bmatrix} \vec x_1 & \vec x_2 & \vec x_3 \end{bmatrix}\\ D_s = \begin{bmatrix} \vec x_1-\vec x_4 & \vec x_2-\vec x_4 & \vec x_3-\vec x_4 \end{bmatrix}\\ \begin{aligned} &\frac{\partial (D_s)_{kl}}{\partial \vec {\mathbf{x}_{ij}}} e_i\otimes e_j\otimes e_k\otimes e_l\\ &=\frac{\partial \vec {\mathbf{x}}_{kl}}{\partial \vec {\mathbf{x}}_{ij}} e_i\otimes e_j\otimes e_k\otimes e_l\\ &= \delta_{ik}\delta_{jl} e_i\otimes e_j\otimes e_k\otimes e_l \end{aligned} \] 这里把\(\vec {\mathbf{x}}\)后三列写成一个3x3的矩阵。\(D_m^{-1}\)的分量表示为\(d_{mn}\)，\(P\)的分量表示为\(P_{rs}\)，则能量密度函数\(\Psi\)关于位置\(\vec {\mathbf{x}}\) 的梯度为：

SVD

2023-12-24T04:29:22.000Z

Understading the Geometry of \(A\in \mathbb{R}^{m\times n}\)

\[ A\in \mathbb{R}^{m\times n} \\ x\in \mathbb{R}^n \mapsto Ax \in \mathbb{R}^m \]

QR iteration

2023-12-23T05:14:20.000Z

Assumption: \(A \in \mathbb{R}^{n \times n}\) is symmetric, no repeated eigenvalues \(\lambda\)s.

Eigenvalue iteration, deflation

2023-12-23T05:14:17.000Z

Two basic Properties

Lemma: Every matrix \(A \in \mathbb{R}^{n \times n}\) has at least one (complex) eigenvalue.

Applications of eigenvectors

2023-12-22T05:14:13.000Z

很多时候需要把已有问题转化为特征值问题。

有一个对称矩阵（有时候是对称半正定矩阵），我们想要找一个方便的基。
具体的优化问题。

Column Space QR

2023-12-21T02:46:03.000Z

High-Level Idea

Why QR?

\[ \begin{aligned} cond A^T A &= ||A^T A|| \cdot ||(A^T A)^{-1}||\\ &\approx ||A^T|| \cdot ||A|| \cdot ||A^{-1}|| \cdot ||A^{-T}|| &= cond A^2 \end{aligned} \]

为了避免计算\(A^T A\)，我们可以使用QR分解。

Projective Dynamics 的local step实现推导

2023-12-20T08:08:06.000Z

介绍

Projective Dynamics是一种用于软体模拟的方法，算法分为local step和global step两个部分。其中local step可以对于每个四面体约束并行计算，global step只需要求解一个线性方程组，而他的矩阵非常特殊，是一个Gram矩阵，因此可以预先用Cholesky分解。本文主要介绍local step的实现的推导过程。用Corotated strain model的CUDA的实现作为例子。

Condition number for linear systems

2023-12-20T04:06:16.000Z

Sensitivity and Conditioning

Gaussian elimination works in theory, but what about floating point precision?

How much can we trust \(x_0\) if \(0<||Ax_0 - b||_2 \ll 1\)(backwards error)?

Cholesky factorization, sparse matrices

2023-12-17T10:23:19.000Z

Structured Linear Systems

Gaussian elimination and/or LU can solve all the example problems above. But these systems can have special properties that make them easier or stabler to solve.

Today's example: Positive definite, sparsity.

Linear Systems and LU

2023-12-17T04:50:28.000Z

Linear Systems

\[ \begin{aligned} Ax &= b \\ A &\in \mathbb{R}^{n \times n} \\ x &\in \mathbb{R}^n \\ b &\in \mathbb{R}^n \end{aligned} \]

Introduction, number systems, measuring error

2023-12-16T07:47:52.000Z

Mathematically correct != numerically sound. Using Tolerance:

double x = 1.0;
double y = x / 3.0;
if(fabs(x-y*3.0) < numeric_limits<double>::epsilon()){
    cout << "They are equal" << endl;
}
else
    cout << "They are not equal" << endl;

对LLVM的Pass进行调试的vscode配置

2023-11-20T04:26:12.000Z

文件夹结构

安装LLVM的过程省去，假设当前目录下有:

CMakeLists.txt文件；编译pass
include文件夹：pass的头文件
src文件夹；pass的源码
test文件夹：用于静态分析pass的测试代码（.c文件）

张量

2023-11-19T15:32:17.000Z

对于三维空间中的各向同性材料，弹性模量张量（Elastic Modulus Tensor）可以视为一个\(3 \times 3\)矩阵，其中每个元素本身又是一个 \(3 \times 3\) 的矩阵。这样的表示反映了张量的四阶本质，即它有四个索引。

GAMES103:刚体模拟笔记

2023-04-09T09:50:21.000Z

2 Math

Tetrahedral Volume

VkPresentModeKHR-呈现模式

2023-03-07T06:15:11.000Z

在构建交换链 vk::SwapchainCreateInfoKHR时，有一个设置是 VkPresentModeKHR。根据官方手册VkPresentModeKHR(3) Manual Page，呈现模式有6种。

Bspline

2023-03-04T14:13:56.000Z