Fiber Bundles, Connections, and Curvature

Fiber bundles, vector bundles, principal bundles, connections, curvature, characteristic classes, and Berry bundles.

Mathematics / Fiber Bundles / Bundles, connections, and curvature

Fiber Bundles notes
Sections

Reading Guide

This is the comprehensive fiber-bundle note. It keeps the long-form route from local triviality to vector bundles, principal bundles, connections, curvature, characteristic classes, Berry curvature, and Bloch bundles.

For shorter reads, use the neighboring section notes:

  • Topological Fiber Bundles
  • Vector Bundles, Frames, and Sections
  • Principal Bundles and Connections
  • Chern Classes and Bloch Bundles

The shorter pages are intended as polished entry points. This page remains useful as the full continuity-preserving version.

Purpose of these notes These notes are deliberately slow. They assume you have seen manifolds, tangent vectors, differential forms, pullbacks, pushforwards, flows, and perhaps Lie derivatives before, but may not remember their formal definitions. The goal is not to replace a full course in differential geometry. The goal is to make the following sentences precise and usable: \[\begin{gather*} \text{a vector field is a section of } TM,\\ \text{a one-form is a section of } T^*M,\\ \text{a vector bundle is a special kind of fibre bundle,}\\ \text{a gauge field is a local expression of a connection,}\\ \text{Berry curvature is curvature of the Bloch bundle.} \end{gather*}\] The note begins with topology and manifolds, then builds fibre bundles formally, then specializes to vector bundles, Lie groups, principal bundles, connections, curvature, cohomology, and first Chern numbers.

Scope on a first pass You do not need Chern–Simons theory, anomaly inflow, index theorems, or full algebraic topology to understand the basic fibre-bundle language. These topics are later applications. For a first pass, the essential chain is \[\begin{align*} \text{topological space} &\longrightarrow \text{manifold} \longrightarrow \text{tangent/cotangent spaces} \\ &\longrightarrow \text{forms and pullback} \longrightarrow \text{fibre bundles} \\ &\longrightarrow \text{vector bundles} \longrightarrow \text{connections} \\ &\longrightarrow \text{curvature} \longrightarrow \text{cohomology} \longrightarrow \text{Chern number}. \end{align*}\]

The Big Picture

A fibre bundle is a mathematical device for describing the following situation:

At each point of some base space, another space is attached. Locally the whole object looks like a product, but globally the attachment may be twisted.

The basic notation is \[\pi:E\to B.\] Here:

  • \(B\) is the base space. In physics this may be real space, spacetime, parameter space, or momentum space.

  • \(E\) is the total space. It contains all the attached fibres.

  • \(\pi\) is the projection map. It tells you which base point a point of \(E\) lies above.

  • The fibre over \(b\in B\) is \[E_b:=\pi^{-1}(b).\]

One-sentence version A fibre bundle is locally a product \(U\times F\), but it may fail to be globally a product \(B\times F\).

The main hierarchy is this: \[\boxed{\text{topological fibre bundle}} \quad \supset \quad \boxed{\text{vector bundle}}\] Vector bundles are fibre bundles whose fibres are vector spaces and whose transition functions are linear. Principal bundles are another important class of fibre bundles, where the fibre is a group \(G\) and the bundle describes gauge frames.

Physical language Bundle language
Vector field on a manifold Section of the tangent bundle \(TM\to M\)
One-form such as \(A=A_\mu dx^\mu\) Section of the cotangent bundle \(T^*M\to M\)
Electromagnetic vector potential Local expression of a connection on a \(U(1)\) bundle
Berry connection Connection on a Bloch vector bundle
Berry curvature Curvature of that connection
Chern number Integral of a characteristic class
Bloch occupied states \(\{|u_a(k)\rangle\}\) Local frame for a vector bundle over the Brillouin zone
Gauge transformation Change of local trivialization/frame

The note is written to prevent a common problem: if one jumps directly to Berry bundles or Chern numbers, the words “base space,” “fibre,” “section,” and “connection” can feel like labels rather than definitions. We therefore build them in order.

Topological Fibre Bundles

The formal definition

We now define fibre bundles formally, before specializing to vector bundles.

Definition 11 (Topological fibre bundle). A topological fibre bundle consists of topological spaces \(E\), \(B\), and \(F\), together with a continuous surjective map \[\pi:E\to B,\] such that there exists an open cover \(\{U_i\}_{i\in I}\) of \(B\) and homeomorphisms \[\phi_i:\pi^{-1}(U_i)\to U_i\times F\] satisfying \[\mathrm{pr}_1\circ \phi_i=\pi\] on \(\pi^{-1}(U_i)\). Here \(\mathrm{pr}_1:U_i\times F\to U_i\) is projection onto the first factor.

The homeomorphism \(\phi_i\) is called a local trivialization. The condition \[\mathrm{pr}_1\circ \phi_i=\pi\] means that the trivialization respects the base point: points lying over \(b\in U_i\) are identified with points in \(\{b\}\times F\).

The fibre over \(b\in B\) is \[E_b=\pi^{-1}(b).\] The local trivialization identifies it with \(F\): \[E_b\cong F.\]

What “locally product” means The definition does not say that \(E\) is globally \(B\times F\). It says that there exists at least one open cover \(\{U_i\}\) such that each restricted piece \(\pi^{-1}(U_i)\) is a product \(U_i\times F\).

Does every open cover need to trivialize the bundle?

No. The definition requires the existence of a trivializing open cover. Equivalently, for every \(b\in B\), there must exist some open neighbourhood \(U\) of \(b\) such that \[\pi^{-1}(U)\cong U\times F.\] It is not required that every open set \(U\subset B\) trivialize the bundle.

However, if \(U_i\) is a trivializing open set and \(V\subset U_i\) is a smaller open set, then the restriction also trivializes: \[\pi^{-1}(V)\cong V\times F.\] Thus any refinement of a trivializing cover is again a trivializing cover.

Möbius strip The Möbius strip is locally \(U\times\mathbb{R}\) over small arcs \(U\subset S^1\), but it is not globally \(S^1\times\mathbb{R}\). Therefore the open set \(U=S^1\) itself does not trivialize the Möbius bundle.

Transition functions

On an overlap \(U_i\cap U_j\), we have two trivializations: \[\phi_i:\pi^{-1}(U_i\cap U_j)\to (U_i\cap U_j)\times F,\] \[\phi_j:\pi^{-1}(U_i\cap U_j)\to (U_i\cap U_j)\times F.\] The change of trivialization is \[\phi_i\circ\phi_j^{-1}:(U_i\cap U_j)\times F\to (U_i\cap U_j)\times F.\] Because both trivializations preserve the base point, this map has the form \[\phi_i\circ\phi_j^{-1}(b,f)=(b,\tau_{ij}(b)(f)),\] where \[\tau_{ij}(b):F\to F\] is a homeomorphism of the fibre.

If one chooses a structure group \(G\subset \mathop{\mathrm{Aut}}(F)\) acting on \(F\), then transition functions are maps \[g_{ij}:U_i\cap U_j\to G\] such that \[\phi_i\circ\phi_j^{-1}(b,f)=(b,g_{ij}(b)\cdot f).\]

On triple overlaps \(U_i\cap U_j\cap U_k\), consistency gives \[g_{ij}g_{jk}=g_{ik}.\] Also, \[g_{ii}=1, \qquad g_{ji}=g_{ij}^{-1}.\]

Local data and global object A fibre bundle can be described by local products \(U_i\times F\) plus transition functions on overlaps. The nontrivial global information is contained in how the local products are glued.

Constructing a bundle from transition functions

Conversely, suppose we are given:

  • a base space \(B\) with open cover \(\{U_i\}\),

  • a fibre \(F\),

  • transition functions \(g_{ij}:U_i\cap U_j\to G\subset\mathop{\mathrm{Aut}}(F)\) satisfying \(g_{ij}g_{jk}=g_{ik}\).

Then one can construct the total space by gluing: \[E=\left(\bigsqcup_i U_i\times F\right)/\sim,\] where \[(b,f)_j\sim (b,g_{ij}(b)\cdot f)_i\] for \(b\in U_i\cap U_j\).

This construction is important because it shows that the total space \(E\) is not usually guessed first as a product. It is often built by gluing local products.

Smooth fibre bundles

If \(E\), \(B\), and \(F\) are smooth manifolds, a smooth fibre bundle is defined similarly, with all relevant maps smooth and local trivializations diffeomorphisms: \[\phi_i:\pi^{-1}(U_i)\to U_i\times F.\] The projection \(\pi:E\to B\) is then a smooth map, and locally it looks like projection \[U_i\times F\to U_i.\]

Base space assumptions The base \(B\) of a topological fibre bundle can be a general topological space. In differential geometry and physics, \(B\) is usually a smooth manifold. Compactness is not required. If \(B\) is compact, one can often choose finite subcovers; if it is not compact, the definitions still work.

Sections of a fibre bundle

Let \(\pi:E\to B\) be any fibre bundle. A section is a map \[s:B\to E\] such that \[\pi\circ s=\mathrm{id}_B.\] This means \[s(b)\in E_b\] for each \(b\in B\).

A section chooses one point in each fibre. When the fibre is a vector space, sections are fields. When the fibre is a group or frame space, sections are gauge choices or frame choices.

Vector Bundles From Scratch

The problem vector bundles solve

In ordinary linear algebra, we work with one fixed vector space \(V\). But on a manifold, the natural vector space may depend on the point. For example, at each point \(p\in M\) there is a tangent space \(T_pM\). These spaces are all isomorphic to \(\mathbb{R}^n\), but there is usually no canonical way to identify \(T_pM\) with \(T_qM\) for two different points \(p,q\in M\).

A vector bundle is a controlled way of saying:

There is a vector space attached to every point of the base space, and these vector spaces vary continuously or smoothly from point to point.

Formal definition

Definition 12 (Real vector bundle). A rank-\(r\) real vector bundle over a topological space \(B\) is a topological fibre bundle \[\pi:E\to B\] with fibre \(\mathbb{R}^r\), such that:

  1. Each fibre \(E_b=\pi^{-1}(b)\) is an \(r\)-dimensional real vector space.

  2. There exists an open cover \(\{U_i\}\) of \(B\) and local trivializations \[\phi_i:\pi^{-1}(U_i)\to U_i\times\mathbb{R}^r\] that restrict on each fibre \(E_b\) to a linear isomorphism \[E_b\to \{b\}\times\mathbb{R}^r\cong\mathbb{R}^r.\]

  3. On overlaps, transition functions take values in \(\mathop{\mathrm{GL}}(r,\mathbb{R})\).

A complex vector bundle is the same definition with \(\mathbb{C}^r\) and \(\mathop{\mathrm{GL}}(r,\mathbb{C})\).

If \(B=M\) is a smooth manifold and \(E\) is also a smooth manifold, then a smooth vector bundle requires smooth local trivializations and smooth transition functions \[g_{ij}:U_i\cap U_j\to \mathop{\mathrm{GL}}(r,\mathbb{R})\] or \[g_{ij}:U_i\cap U_j\to \mathop{\mathrm{GL}}(r,\mathbb{C}).\]

Definition 13 (Line bundle). A rank-\(1\) vector bundle is called a line bundle. A rank-\(1\) real vector bundle is a real line bundle, with fibre \(\mathbb{R}\). A rank-\(1\) complex vector bundle is a complex line bundle, with fibre \(\mathbb{C}\).

For a real line bundle, the transition functions take values in \[\mathop{\mathrm{GL}}(1,\mathbb{R})=\mathbb{R}^\times,\] the nonzero real numbers under multiplication. If a fibre metric is chosen and local frames are normalized, the structure group reduces to \[O(1)=\{+1,-1\}.\] This is why the Möbius strip can be described using transition functions equal to \(+1\) or \(-1\).

Is \(E\) arbitrary or equal to \(M\times\mathbb{R}^{\dim M}\)? The total space \(E\) is not assumed to be \(M\times\mathbb{R}^{\dim M}\). It is an independent space equipped with a projection \(\pi:E\to M\) and local product structures. Only the trivial rank-\(r\) bundle is globally \(M\times\mathbb{R}^r\). The rank \(r\) is independent of \(\dim M\) except in special examples such as the tangent bundle, where \(r=\dim M\).

If \(M\) is an \(n\)-dimensional smooth manifold and \(E\to M\) is a rank-\(r\) real smooth vector bundle, then the total space \(E\) is locally modeled on \[U\times\mathbb{R}^r\subset \mathbb{R}^n\times\mathbb{R}^r,\] so \(E\) is an \((n+r)\)-dimensional smooth manifold. For the tangent bundle \(TM\to M\), \(r=n\), so \(TM\) has dimension \(2n\).

The trivial bundle

The simplest vector bundle is the product \[E=M\times\mathbb{R}^r.\] The projection is \[\pi(p,v)=p.\] The fibre over \(p\) is \[E_p=\{p\}\times\mathbb{R}^r\cong\mathbb{R}^r.\] This is called the trivial rank-\(r\) bundle.

A bundle is trivial if there exists a global trivialization \[E\cong M\times\mathbb{R}^r\] compatible with the projection and vector-space structures. Many bundles are locally trivial but not globally trivial.

Local frames

Let \(E\to M\) be a rank-\(r\) vector bundle. On an open set \(U\subset M\), a local frame is a collection of \(r\) local sections \[e_1,\ldots,e_r\] such that for every \(p\in U\), the vectors \[e_1(p),\ldots,e_r(p)\] form a basis of \(E_p\).

Choosing a local frame is equivalent to choosing a local trivialization. If a section \(s\) is defined on \(U\), then \[s(p)=e_a(p)\psi^a(p).\] The functions \(\psi^a\) are the local components of \(s\).

Transition functions for vector bundles

Suppose we have two patches \(U_i\) and \(U_j\) with local frames \[e^{(i)}_1,\ldots,e^{(i)}_r, \qquad e^{(j)}_1,\ldots,e^{(j)}_r.\] On the overlap \(U_i\cap U_j\), both frames are valid. They must be related by an invertible matrix: \[e_a^{(j)}=e_b^{(i)}\,g_{ij}^{ba}.\] In matrix notation, \[e^{(j)}=e^{(i)}g_{ij}.\] The map \[g_{ij}:U_i\cap U_j\to \mathop{\mathrm{GL}}(r,\mathbb{R})\] is called a transition function. For a complex vector bundle, \[g_{ij}:U_i\cap U_j\to \mathop{\mathrm{GL}}(r,\mathbb{C}).\] If the bundle has a Hermitian metric and we choose orthonormal frames, then \[g_{ij}:U_i\cap U_j\to U(r).\] On triple overlaps, \[g_{ij}g_{jk}=g_{ik}.\] This is the vector-bundle cocycle condition.

Analogy with projective representations In projective representations, associativity forces a cocycle condition on phase factors: \[U(g_1)U(g_2)=\omega(g_1,g_2)U(g_1g_2).\] For bundles, consistency of patch gluing forces \[g_{ij}g_{jk}=g_{ik}.\] These are not the same mathematical object, but they express a common idea: local data must agree consistently on triple overlaps or triple products.

Sections

Let \(\pi:E\to M\) be any vector bundle. A section is a smooth map \[s:M\to E\] such that \[\pi\circ s=\mathrm{id}_M.\] This means \(s(p)\in E_p\) for every \(p\in M\).

Notation: \[\Gamma(E)=\text{space of smooth sections of }E.\] This notation is used constantly in differential geometry. For example, \[\Gamma(TM)=\text{vector fields on }M,\] \[\Gamma(T^*M)=\text{one-forms on }M,\] \[\Omega^k(M)=\Gamma(\Lambda^kT^*M)=\text{$k$-forms on }M.\]

If \(E=M\times\mathbb{R}^r\) is trivial, a section is the same thing as a smooth function \[\psi:M\to\mathbb{R}^r,\] because it has the form \[s(p)=(p,\psi(p)).\] For a nontrivial vector bundle, sections are still locally vector-valued functions, but globally they need not be expressible as maps \(M\to\mathbb{R}^r\).

How section components transform

Let \[s=e^{(i)}\psi_i=e^{(j)}\psi_j.\] Since \(e^{(j)}=e^{(i)}g_{ij}\), \[e^{(i)}\psi_i=e^{(i)}g_{ij}\psi_j.\] Therefore \[\psi_i=g_{ij}\psi_j.\]

Physics translation The section \(s\) is the global geometric object. The column vector \(\psi_i\) is only its coordinate description in patch \(U_i\). A gauge transformation changes the description, not the underlying section.

Gauge changes as changes of frame

Suppose on each patch \(U_i\) we change frame by \[e^{(i)}\mapsto \widetilde e^{(i)}=e^{(i)}h_i,\] where \[h_i:U_i\to \mathop{\mathrm{GL}}(r,\mathbb{R})\] or \(\mathop{\mathrm{GL}}(r,\mathbb{C})\).

The transition functions change by \[g_{ij}\mapsto \widetilde g_{ij}=h_i^{-1}g_{ij}h_j.\] Thus transition functions are not individually gauge-invariant. The bundle is the equivalence class of such gluing data under changes of local frames.

First Examples of Nontrivial Bundles

Real line bundles

A real line bundle over a base space \(B\) is a rank-\(1\) real vector bundle \[\pi:L\to B.\] This means that every fibre \[L_b=\pi^{-1}(b)\] is a one-dimensional real vector space, hence isomorphic to \(\mathbb{R}\). Locally, \[\pi^{-1}(U_i)\cong U_i\times\mathbb{R}.\] On overlaps, transition functions are nonzero real-valued functions \[g_{ij}:U_i\cap U_j\to \mathbb{R}^\times.\] If we choose a fibre metric and normalized local frames, then only signs remain: \[g_{ij}:U_i\cap U_j\to O(1)=\{+1,-1\}.\]

The trivial real line bundle over \(B\) is \[B\times\mathbb{R}\to B.\] A nontrivial real line bundle is locally \(U\times\mathbb{R}\) but cannot be globally written as \(B\times\mathbb{R}\) in a way compatible with the vector-space structure on fibres.

The Möbius line bundle

The Möbius strip is the simplest nontrivial real line bundle over \(S^1\).

At every point of \(S^1\), attach a copy of \(\mathbb{R}\). Locally it looks like \[U\times\mathbb{R}.\] But after going once around the circle, the fibre coordinate flips sign: \[v\mapsto -v.\]

The structure group is \[O(1)=\{+1,-1\}.\] The nontrivial transition function is the sign \(-1\).

Moral of the Möbius example Locally, the Möbius bundle is indistinguishable from the cylinder \(S^1\times\mathbb{R}\). Globally, it is twisted. This is the basic phenomenon fibre bundles are designed to capture.

A real line bundle is trivial if it admits a nowhere-zero global section. The Möbius line bundle does not: any attempted nonzero section comes back with opposite sign after going around the circle.

Complex line bundles

A complex line bundle has fibre \(\mathbb{C}\). Its structure group can be taken to be \[\mathop{\mathrm{GL}}(1,\mathbb{C})=\mathbb{C}^\times.\] With a Hermitian metric, it reduces to \[U(1).\] Thus transition functions are maps \[g_{ij}:U_i\cap U_j\to U(1).\] A complex line bundle is the natural home for wavefunctions whose phase can only be chosen locally.

Principal Bundles and Associated Vector Bundles

Vector bundles are enough for tangent bundles, cotangent bundles, and Bloch bundles. Gauge theory is cleaner if one first separates the bundle of gauge frames from the vector spaces on which matter fields live. This separation is the point of principal bundles and associated vector bundles.

Principal bundles

Let \(G\) be a topological group or Lie group. A principal \(G\)-bundle is a fibre bundle whose fibre is \(G\) itself, but with one crucial extra structure: \(G\) acts on the fibre in a way compatible with all local product descriptions.

Definition 22 (Principal \(G\)-bundle). A principal \(G\)-bundle over \(M\) consists of a fibre bundle \[\pi:P\to M\] together with a continuous, or smooth in the smooth category, right action \[P\times G\to P, \qquad (p,g)\mapsto p\cdot g,\] such that:

  1. the action preserves fibres: \[\pi(p\cdot g)=\pi(p),\]

  2. the action is free: if \(p\cdot g=p\), then \(g=e\),

  3. the action is transitive on each fibre: if \(p,q\in P_x:=\pi^{-1}(x)\), then there is a unique \(g\in G\) such that \(q=p\cdot g\),

  4. there exists an open cover \(\{U_i\}\) of \(M\) and local trivializations \[\phi_i:\pi^{-1}(U_i)\to U_i\times G\] which are compatible with the right action in the following precise sense. If \[\phi_i(p)=(x,h),\] then \[\boxed{\phi_i(p\cdot g)=(x,hg).}\]

The last condition is often summarized by saying that the trivializations are \(G\)-equivariant. Written out explicitly, the local model is \[((x,h),g)\mapsto (x,hg).\] The base point \(x\) is not moved. Only the group coordinate in the fibre is multiplied on the right.

Physical interpretation A principal bundle is a bundle of local gauge frames. A point \(p\in P_x\) is a choice of frame at the base point \(x\). Right multiplication \(p\mapsto p\cdot g\) changes the frame at the same base point. It is a vertical operation: it does not move \(x\).

Local sections and transition functions

A local section \[s_i:U_i\to P\] of a principal bundle is a local choice of gauge frame. Given a \(G\)-equivariant trivialization \(\phi_i\), the corresponding section is \[s_i(x)=\phi_i^{-1}(x,e).\] Every point \(p\in P_x\) over \(x\in U_i\) can be uniquely written as \[p=s_i(x)\cdot h\] for a unique \(h\in G\).

On an overlap \(U_i\cap U_j\), two local sections are related by a unique smooth map \[g_{ij}:U_i\cap U_j\to G\] such that \[\boxed{s_j(x)=s_i(x)\cdot g_{ij}(x).}\] These \(g_{ij}\) are the transition functions of the principal bundle. In terms of local trivializations, if \[\phi_j(p)=(x,h),\] then \[p=s_j(x)\cdot h=s_i(x)\cdot g_{ij}(x)h,\] so \[\boxed{ (\phi_i\circ\phi_j^{-1})(x,h)=(x,g_{ij}(x)h). }\] On triple overlaps, consistency gives the cocycle condition \[\boxed{g_{ij}(x)g_{jk}(x)=g_{ik}(x).}\] This is the principal-bundle version of patch-gluing consistency.

Associated vector bundles

Matter fields usually transform in a representation of the gauge group. The principal bundle contains the gauge frames; a representation tells us what vector space a matter field lives in.

Let \(P\to M\) be a principal \(G\)-bundle and let \[\rho:G\to \mathop{\mathrm{GL}}(V)\] be a representation of \(G\) on a real or complex vector space \(V\).

Definition 23 (Associated vector bundle). The associated vector bundle with fibre \(V\) is the quotient space \[E:=P\times_G V:=(P\times V)/\sim,\] where \[(p,v)\sim(p\cdot g,\rho(g^{-1})v)\] for every \(g\in G\). The equivalence class of \((p,v)\) is denoted \[[p,v].\] The projection is \[\pi_E:E\to M, \qquad \pi_E([p,v])=\pi_P(p).\]

The word quotient means that we start with all pairs \((p,v)\) and then declare pairs related by the above rule to represent the same point of \(E\). The inverse in \(\rho(g^{-1})\) is not decorative: it ensures that changing the frame and transforming the coordinate vector in the opposite way leaves the actual geometric vector unchanged.

Choose a local section \(s_i:U_i\to P\). Then every element of \(E_x\) for \(x\in U_i\) can be written as \[[s_i(x),v_i]\] for a unique \(v_i\in V\). If \(s_j=s_i\cdot g_{ij}\), then \[[s_j(x),v_j] = [s_i(x)\cdot g_{ij}(x),v_j] = [s_i(x),\rho(g_{ij}(x))v_j].\] Thus local component vectors obey \[\boxed{v_i=\rho(g_{ij})v_j.}\] This is exactly the vector-bundle transition rule.

Physics examples

  • If spacetime is \(M\) and \(P\to M\) is a principal \(U(1)\)-bundle, a charge-\(q\) complex scalar field is a section of the associated line bundle with representation \[\rho(e^{i\alpha})=e^{iq\alpha}.\] Locally it is a complex function \(\psi_i(x)\), and on overlaps \[\psi_i=e^{iq\alpha_{ij}}\psi_j.\]

  • If \(P\to M\) is a principal \(SU(N)\)-bundle and \(V=\mathbb{C}^N\) is the fundamental representation, a fundamental matter field is a section of \[P\times_{SU(N)}\mathbb{C}^N.\]

  • The adjoint bundle \[\operatorname{Ad}(P):=P\times_G\mathfrak g,\] where \(G\) acts on \(\mathfrak g\) by \(\mathop{\mathrm{Ad}}\), is where non-Abelian field strengths naturally live.

  • Spinor fields are sections of vector bundles associated to a principal \(\mathrm{Spin}(n)\)-bundle through a spinor representation.

Matter fields as sections In physics language, a matter field is not fundamentally a function \(M\to V\). It is a section of an associated vector bundle. It becomes a \(V\)-valued function only after choosing a local gauge frame.

Connections: Why Gauge Fields Exist

The central problem is simple but fundamental. A section of a vector bundle assigns a vector to each point, but those vectors live in different vector spaces. If \(s\in\Gamma(E)\) and \(p,q\in M\), then \[s(p)\in E_p, \qquad s(q)\in E_q.\] Unless the bundle has been trivialized, the expression \[s(q)-s(p)\] is not defined: it tries to subtract vectors in different vector spaces. A connection is the extra structure that makes differentiation of sections meaningful.

The conceptual move

In ordinary calculus, one defines a derivative by subtracting nearby values and taking a limit. For a vector bundle this is not legal until one has a way to compare nearby fibres. One could try to define parallel transport first and then define differentiation, but parallel transport itself needs a rule for what it means to remain parallel. The clean mathematical solution is to define the differentiating machine directly.

Connection in one sentence A connection is an operator \(\nabla\) that differentiates sections and satisfies the product rule one expects from differentiation.

This is not a trick. Once \(\nabla\) is specified, it produces covariant derivatives, parallel transport, curvature, and holonomy. In a local frame it becomes the familiar expression \[\nabla=d+A,\] where \(A\) is the gauge potential or connection one-form.

\(E\)-valued one-forms

Before defining a connection, decode the target space of \(\nabla\).

Let \(E\to M\) be a vector bundle. The tensor product bundle \[T^*M\otimes E\to M\] has fibre \[(T^*M\otimes E)_p=T_p^*M\otimes E_p.\] A section \[\eta\in\Gamma(T^*M\otimes E)\] is called an \(E\)-valued one-form. At each \(p\), it is equivalently a linear map \[\eta_p:T_pM\to E_p.\] If locally \[\eta=\alpha\otimes s,\] where \(\alpha\) is an ordinary one-form and \(s\) is a section of \(E\), then \[\eta(X)=\alpha(X)s.\] For a sum of such terms, extend linearly.

Thus if \(\nabla s\in\Gamma(T^*M\otimes E)\), then feeding in a vector field \(X\) gives a section of \(E\): \[(\nabla s)(X)\in\Gamma(E).\] We write \[\nabla_Xs:=(\nabla s)(X).\] This is also denoted \(\iota_X(\nabla s)\), meaning insertion of \(X\) into the one-form slot.

What \(df\otimes s\) means In the Leibniz rule below, \(df\otimes s\) is not saying that \(s\) is a one-form. It is an \(E\)-valued one-form: \[(df\otimes s)(X)=df(X)s=X[f]s.\] Therefore \(df\otimes s\) and \(f\nabla s\) have the same type: both are sections of \(T^*M\otimes E\).

Connection on a vector bundle: formal definition

Definition 24 (Connection on a vector bundle). A connection on a vector bundle \(E\to M\) is an \(\mathbb{R}\)-linear map \[\nabla:\Gamma(E)\to\Gamma(T^*M\otimes E)\] satisfying the Leibniz rule \[\boxed{\nabla(fs)=df\otimes s+f\nabla s}\] for every \(f\in C^\infty(M)\) and \(s\in\Gamma(E)\).

Equivalently, a connection gives a bilinear operation \[(X,s)\mapsto \nabla_Xs\] satisfying \[\nabla_{fX+gY}s=f\nabla_Xs+g\nabla_Ys,\] \[\nabla_X(fs)=X[f]s+f\nabla_Xs.\] The first identity says that \(\nabla_Xs\) is tensorial in the direction \(X\): if you multiply the direction by a function, the result is multiplied by that function. The second identity says that \(\nabla\) differentiates the section slot.

Local frames and the symbolic formula \(\nabla=d+A\)

Let \(e_1,\ldots,e_r\) be a local frame for \(E\) over \(U\subset M\). Every section over \(U\) can be written uniquely as \[s=\psi^a e_a,\] where \(\psi^a\in C^\infty(U)\) and repeated fibre indices are summed.

A connection must say how to differentiate the local frame itself. Since \(\nabla e_b\) is an \(E\)-valued one-form, it can be expanded as \[\nabla e_b=A^a{}_b\otimes e_a,\] where each \(A^a{}_b\in\Omega^1(U)\) is an ordinary one-form. The collection \[A=(A^a{}_b)\] is a matrix-valued one-form.

Now apply the Leibniz rule: \[\begin{align*} \nabla s &=\nabla(\psi^b e_b)\\ &=d\psi^b\otimes e_b+\psi^b\nabla e_b\\ &=d\psi^a\otimes e_a+\psi^b A^a{}_b\otimes e_a\\ &=\left(d\psi^a+A^a{}_b\psi^b\right)\otimes e_a. \end{align*}\] This is the precise \(T^*M\otimes E\)-valued formula. Some authors swap the tensor factors and write the same object as \[e_a\otimes \left(d\psi^a+A^a{}_b\psi^b\right).\] The meaning is the same after the canonical swap \(T^*M\otimes E\cong E\otimes T^*M\).

If we evaluate on a vector field \(X\), we get \[\boxed{ \nabla_Xs= \left(X[\psi^a]+A^a{}_b(X)\psi^b\right)e_a. }\] In matrix notation, if \(e\) is the row vector of basis sections and \(\psi\) is the column vector of components, \[s=e\psi, \qquad \nabla_Xs=e\left(X[\psi]+A(X)\psi\right).\] The shorthand \[\boxed{\nabla=d+A}\] means precisely this local formula after choosing a local frame.

Intuition for the correction term The term \(d\psi\) differentiates the component functions. The term \(A\psi\) records how the chosen local frame changes from point to point. In physics language, \(A\) is the gauge field. It is the translation rule telling us how the fibre at a nearby point is tilted relative to the fibre at the present point, as seen in the chosen frame.

Why connection one-forms are Lie-algebra-valued, and how they act on matter

Suppose the vector bundle \(E\) is associated to a principal \(G\)-bundle: \[E=P\times_G V,\] where the action of \(G\) on the model fibre \(V\) is given by a representation \[\rho:G\to \mathop{\mathrm{GL}}(V).\] The principal connection itself is locally represented by a \(\mathfrak g\)-valued one-form \[\omega_i\in \Omega^1(U_i;\mathfrak g),\] where \[\mathfrak g=T_eG.\] This means that for a tangent vector \(Y\in T_xM\), \[\omega_i(Y)\in\mathfrak g.\] At first sight this may look puzzling: an element of \(\mathfrak g\) is an infinitesimal group element, while a matter field value lies in \(V\). To let \(\omega_i(Y)\) act on a vector in \(V\), one must use the derivative of the representation.

The representation \(\rho:G\to\mathop{\mathrm{GL}}(V)\) is a smooth map between Lie groups. Its differential at the identity is \[\boxed{ d\rho_e:T_eG\to T_I\mathop{\mathrm{GL}}(V). }\] By definition, \[T_eG=\mathfrak g.\] Also, \[T_I\mathop{\mathrm{GL}}(V)\cong \mathrm{End}(V).\] Indeed, \(\mathop{\mathrm{GL}}(V)\) is an open subset of the vector space \(\mathrm{End}(V)\), because invertibility is the open condition \(\det\neq0\) in any basis. Therefore its tangent space at the identity is naturally the full vector space of endomorphisms of \(V\). We therefore get the induced Lie-algebra representation \[\boxed{ d\rho:\mathfrak g\to\mathrm{End}(V). }\] Concretely, if \(X\in\mathfrak g\) is represented by a curve \(g(t)\) in \(G\) with \[g(0)=e, \qquad \dot g(0)=X,\] then \[\boxed{ d\rho(X)=\left.\frac{d}{dt}\right|_{t=0}\rho(g(t)). }\] This derivative is an element of \(\mathrm{End}(V)\), so it acts on \(v\in V\) by ordinary linear algebra: \[d\rho(X)v\in V.\] Equivalently, \[\left.\frac{d}{dt}\right|_{t=0}\rho(g(t))v=d\rho(X)v.\]

Thus the associated vector-bundle connection has the local form \[\nabla_Ys=e\left(Y[\psi]+d\rho(\omega_i(Y))\psi\right),\] where \(e\) is the local frame induced by a local section of \(P\), and \(\psi\) is the local component vector of \(s\). If one writes \[A_i:=d\rho(\omega_i)\in\Omega^1(U_i;\mathrm{End}(V)),\] then this becomes the familiar formula \[\boxed{ \nabla_Ys=e\left(Y[\psi]+A_i(Y)\psi\right). }\]

In many physics examples the notation suppresses \(d\rho\). For instance, in the fundamental representation of \(U(N)\) on \(\mathbb{C}^N\), an element of \[\mathfrak u(N)=\{X\in M_N(\mathbb{C}):X^\dagger=-X\}\] is already an \(N\times N\) matrix acting on \(\mathbb{C}^N\). In that case \(d\rho(X)=X\) in the chosen representation, and one simply writes \(A(Y)\psi\). In a different representation, however, the same abstract \(\mathfrak g\)-valued connection form acts by the corresponding matrix \(d\rho(A(Y))\).

Do not confuse \(G\), \(\mathfrak g\), and \(\mathrm{End}(V)\) The finite gauge transformation is an element \(g\in G\). The infinitesimal connection value is an element \(A(Y)\in\mathfrak g\). A matter field vector lies in \(V\). The reason \(A(Y)\) can multiply a matter vector is that the representation differentiates to \[d\rho:\mathfrak g\to\mathrm{End}(V).\] Only after this map has been applied does the Lie-algebra element become a linear operator on the fibre.

Why ordinary derivatives do not transform correctly

Suppose on an overlap \(U_i\cap U_j\) we have two frames related by \[e^{(j)}=e^{(i)}g_{ij},\] where \(g_{ij}:U_i\cap U_j\to \mathop{\mathrm{GL}}(r,\mathbb{R})\) or \(\mathop{\mathrm{GL}}(r,\mathbb{C})\) is an ordinary smooth matrix-valued function. It is not a Grassmann variable. In path-integral physics, a fermion field may be Grassmann-valued, but the gauge transformation matrix \(g_{ij}(x)\) is still an ordinary even matrix acting on the field components.

A section has components \[s=e^{(i)}\psi_i=e^{(j)}\psi_j.\] Therefore \[\psi_i=g_{ij}\psi_j.\] Taking an ordinary derivative gives \[d\psi_i=(dg_{ij})\psi_j+g_{ij}d\psi_j.\] The extra term \((dg_{ij})\psi_j\) means that ordinary derivatives of local components do not transform like the components themselves. A connection is designed so that \[D_i\psi_i=g_{ij}D_j\psi_j.\]

Gauge transformation law for a vector-bundle connection

Write the covariant derivative in frame \(i\) as \[D_i=d+A_i\] and in frame \(j\) as \[D_j=d+A_j.\] The geometric object \(\nabla s\) is globally defined, so its components must transform like the components of \(s\): \[D_i\psi_i=g_{ij}D_j\psi_j.\] Using \(\psi_i=g_{ij}\psi_j\), \[\begin{align*} D_i\psi_i &=d(g_{ij}\psi_j)+A_i g_{ij}\psi_j\\ &=(dg_{ij})\psi_j+g_{ij}d\psi_j+A_i g_{ij}\psi_j. \end{align*}\] The right-hand side is \[g_{ij}D_j\psi_j=g_{ij}d\psi_j+g_{ij}A_j\psi_j.\] Canceling \(g_{ij}d\psi_j\) and using arbitrary \(\psi_j\) gives \[dg_{ij}+A_i g_{ij}=g_{ij}A_j.\] Multiplying by \(g_{ij}^{-1}\) on the left, \[\boxed{A_j=g_{ij}^{-1}A_i g_{ij}+g_{ij}^{-1}dg_{ij}.}\] The term \(g_{ij}^{-1}dg_{ij}\) appears because the change of frame depends on position.

What the inhomogeneous term means The extra term \(g_{ij}^{-1}dg_{ij}\) appears because the change of frame depends on position. If the frame change were constant, ordinary conjugation would be enough. Since the frame itself varies over \(M\), differentiating \(g_{ij}\psi_j\) produces an additional derivative of \(g_{ij}\).

For a \(U(1)\) line bundle, with \(g_{ij}=e^{i\chi_{ij}}\), \[g_{ij}^{-1}dg_{ij}=i\,d\chi_{ij}.\] Depending on whether one uses Hermitian or anti-Hermitian gauge potentials, this becomes the familiar physics formula \(A\mapsto A+d\chi\) up to sign and factors of \(i\).

Local gauge transformations versus transition functions

There are two closely related but distinct uses of \(G\)-valued functions.

First, transition functions \[g_{ij}:U_i\cap U_j\to G\] are part of the gluing data of a bundle. They tell us how the frame on patch \(U_j\) is compared with the frame on patch \(U_i\).

Second, a local gauge transformation is a change of local frame inside the same bundle. On a patch \(U_i\), choose \[h_i:U_i\to G.\] If the old frame is \(e^{(i)}\), the new frame may be written \[\widetilde e^{(i)}=e^{(i)}h_i.\] Then local components and gauge potentials change by \[\widetilde\psi_i=h_i^{-1}\psi_i, \qquad \widetilde A_i=h_i^{-1}A_i h_i+h_i^{-1}dh_i.\] On overlaps, the transition functions also change: \[\widetilde g_{ij}=h_i^{-1}g_{ij}h_j.\] Thus a gauge transformation is not literally the same thing as a transition function. Rather, both are \(G\)-valued functions describing changes of local frame. In QFT on flat spacetime \(\mathbb{R}^d\), one often assumes the bundle is trivial and uses one global patch; then a gauge transformation is just a smooth map \[h:M\to G.\] On a nontrivial bundle there may be no global frame, so the same idea is described by compatible local functions \(h_i:U_i\to G\).

Local does not mean non-geometric The local gauge potential \(A_i\) is defined only after choosing a local trivialization. On a nontrivial bundle, one generally cannot choose a single \(A\) on all of \(M\). The global object is the connection; the local objects \(A_i\) are its coordinate descriptions, related by gauge transformation laws on overlaps.

Connection on the tangent bundle and Christoffel symbols

A connection on \(TM\) differentiates vector fields: \[\nabla_XY\in\Gamma(TM).\] Here \(X\) is the direction of differentiation and \(Y\) is the vector field being differentiated.

Choose coordinates \(x^1,\ldots,x^n\) on \(U\subset M\). The coordinate vector fields \[\partial_1,\ldots,\partial_n, \qquad \partial_\mu:=\frac{\partial}{\partial x^\mu},\] form a local frame for \(TM\). Since \(\nabla_{\partial_\mu}\partial_\nu\) is again a vector field, it can be expanded in this frame: \[\boxed{ \nabla_{\partial_\mu}\partial_\nu = \Gamma^\lambda{}_{\mu\nu}\partial_\lambda. }\] The coefficients \(\Gamma^\lambda{}_{\mu\nu}\) are the Christoffel symbols, or connection coefficients, of this connection in this coordinate frame.

This equation should be read exactly like the vector-bundle formula \(\nabla e_b=A^a{}_b\otimes e_a\). For \(TM\), the local frame is \(e_\nu=\partial_\nu\), and \[(A_\mu)^\lambda{}_{\nu}=\Gamma^\lambda{}_{\mu\nu}.\]

If \[Y=Y^\nu\partial_\nu,\] then \[\begin{align*} \nabla_{\partial_\mu}Y &=\nabla_{\partial_\mu}(Y^\nu\partial_\nu)\\ &=(\partial_\mu Y^\nu)\partial_\nu+Y^\nu\nabla_{\partial_\mu}\partial_\nu\\ &=(\partial_\mu Y^\lambda+\Gamma^\lambda{}_{\mu\nu}Y^\nu)\partial_\lambda. \end{align*}\] Therefore \[\boxed{(\nabla_\mu Y)^\lambda=\partial_\mu Y^\lambda+\Gamma^\lambda{}_{\mu\nu}Y^\nu.}\]

Christoffel symbols are not ordinary partial derivatives of basis vectors The expression \(\partial_\mu\partial_\nu\) is not intrinsically meaningful as a derivative of a vector field on a manifold. The connection defines what it means to differentiate the basis vector field \(\partial_\nu\) in the direction \(\partial_\mu\). The result is expanded in the same basis, and the coefficients are \(\Gamma^\lambda{}_{\mu\nu}\).

Covariant derivative of covectors and tensors

A one-form \[\alpha=\alpha_\nu dx^\nu\] is a section of \(T^*M\). A connection on \(TM\) induces a connection on \(T^*M\) by requiring the natural pairing to obey the product rule: \[X[\alpha(Y)]=(\nabla_X\alpha)(Y)+\alpha(\nabla_XY).\] Set \(X=\partial_\mu\) and \(Y=\partial_\nu\). Since \(\alpha(\partial_\nu)=\alpha_\nu\), \[\begin{align*} \partial_\mu\alpha_\nu &=(\nabla_\mu\alpha)(\partial_\nu)+\alpha(\nabla_{\partial_\mu}\partial_\nu)\\ &=(\nabla_\mu\alpha)_\nu+\Gamma^\lambda{}_{\mu\nu}\alpha_\lambda. \end{align*}\] Thus \[\boxed{(\nabla_\mu\alpha)_\nu=\partial_\mu\alpha_\nu-\Gamma^\lambda{}_{\mu\nu}\alpha_\lambda.}\] The minus sign is forced by compatibility with the pairing between vectors and covectors.

For a tensor, one gets one \(+\Gamma\) term for each upper index and one \(-\Gamma\) term for each lower index. This is just repeated use of the product rule.

Torsion

The Lie bracket \([X,Y]\) measures the infinitesimal failure of the flows of \(X\) and \(Y\) to close as a coordinate parallelogram. The expression \[\nabla_XY-\nabla_YX\] measures an antisymmetrized change of vector fields using the chosen connection. These are not automatically the same.

Definition 25 (Torsion). The torsion tensor of a connection on \(TM\) is \[\boxed{T(X,Y)=\nabla_XY-\nabla_YX-[X,Y].}\]

Torsion is tensorial in \(X\) and \(Y\), so it depends only on the values \(X_p,Y_p\) at a point, not on how the vector fields are extended nearby.

In a coordinate frame, \([\partial_\mu,\partial_\nu]=0\), so \[T(\partial_\mu,\partial_\nu) = (\Gamma^\lambda{}_{\mu\nu}-\Gamma^\lambda{}_{\nu\mu})\partial_\lambda.\] Thus \[T^\lambda{}_{\mu\nu}=\Gamma^\lambda{}_{\mu\nu}-\Gamma^\lambda{}_{\nu\mu}.\] A connection is torsion-free if \[\nabla_XY-\nabla_YX=[X,Y].\] In coordinates, torsion-free means \[\Gamma^\lambda{}_{\mu\nu}=\Gamma^\lambda{}_{\nu\mu}\] for coordinate vector fields.

Intuition for torsion The bracket \([X,Y]\) is the displacement defect of the infinitesimal flow parallelogram. The term \(\nabla_XY-\nabla_YX\) is what the connection says the antisymmetric change of the directions should be. Torsion is the mismatch between the connection’s antisymmetric change and the manifold’s actual flow-commutator defect.

Levi-Civita connection

A general connection on \(TM\) is not determined by the smooth manifold alone. If \(M\) has a Riemannian metric \(g\), there is a distinguished connection.

Definition 26 (Levi-Civita connection). The Levi-Civita connection is the unique connection on \(TM\) that is:

  1. torsion-free: \[T(X,Y)=0,\] that is, \[\nabla_XY-\nabla_YX=[X,Y],\]

  2. metric-compatible: \[X[g(Y,Z)]=g(\nabla_XY,Z)+g(Y,\nabla_XZ).\]

In coordinates, its Christoffel symbols are \[\boxed{ \Gamma^\lambda{}_{\mu\nu} =\frac12 g^{\lambda\rho} \left( \partial_\mu g_{\nu\rho} +\partial_\nu g_{\mu\rho} -\partial_\rho g_{\mu\nu} \right). }\] This formula is not the definition of every connection. It is the special formula for the unique torsion-free, metric-compatible connection determined by a metric.

Christoffel symbols are not tensors The numbers \(\Gamma^\lambda{}_{\mu\nu}\) depend on the coordinate system. Under coordinate changes they transform with an inhomogeneous term, like a gauge potential. The connection is geometric; its local coefficients are coordinate-dependent.

Curvature as noncommuting covariant derivatives

The curvature of a vector-bundle connection is the obstruction to commuting covariant derivatives, corrected by the bracket of the directions.

Definition 27 (Curvature of a vector-bundle connection). For vector fields \(X,Y\), \[\boxed{ F(X,Y)s = \nabla_X\nabla_Ys - \nabla_Y\nabla_Xs - \nabla_{[X,Y]}s. }\]

Why is the final term needed? Moving along \(X\) then \(Y\) and moving along \(Y\) then \(X\) do not generally end at the same point. The endpoint mismatch is of order area and is governed by \([X,Y]\). To compare the transported vectors fairly, one must correct by the covariant derivative in the gap direction. This is exactly the role of \(-\nabla_{[X,Y]}s\).

Infinitesimal-loop intuition For a small parameter \(\epsilon\), the difference between the two ordered second covariant derivatives is area order: \[\epsilon^2(\nabla_X\nabla_Ys-\nabla_Y\nabla_Xs).\] But the two endpoint paths also differ by an area-order displacement \(\epsilon^2[X,Y]\). Pulling one endpoint back along this displacement contributes \[\epsilon^2\nabla_{[X,Y]}s.\] After subtracting this basepoint mismatch and dividing by the area \(\epsilon^2\), the remaining infinitesimal holonomy density is \[F(X,Y)s.\]

Derivation of \(F=dA+A\wedge A\)

In a local frame, write \[\nabla=d+A.\] Let \(\psi\) be the column vector of components of a section. Then \[\nabla\psi=d\psi+A\psi.\] Apply \(\nabla\) again. Here \(d\psi\) is a vector-valued one-form, and \(A\) is a matrix-valued one-form. Using the graded product rule, \[d(A\psi)=dA\,\psi-A\wedge d\psi.\] Therefore \[\begin{align*} \nabla^2\psi &=(d+A)(d\psi+A\psi)\\ &=d^2\psi+d(A\psi)+A\wedge d\psi+A\wedge A\psi\\ &=0+(dA\,\psi-A\wedge d\psi)+A\wedge d\psi+A\wedge A\psi\\ &=(dA+A\wedge A)\psi. \end{align*}\] Thus \[\boxed{F=dA+A\wedge A.}\] For matrix-valued forms, \[(A\wedge A)^a{}_b=A^a{}_c\wedge A^c{}_b.\]

Write \[A=A_\mu dx^\mu.\] Then \[dA=(\partial_\mu A_\nu)dx^\mu\wedge dx^\nu =\frac12(\partial_\mu A_\nu-\partial_\nu A_\mu)dx^\mu\wedge dx^\nu.\] Also \[A\wedge A =\frac12[A_\mu,A_\nu]dx^\mu\wedge dx^\nu.\] Therefore \[\boxed{ F=\frac12F_{\mu\nu}dx^\mu\wedge dx^\nu, \qquad F_{\mu\nu}=\partial_\mu A_\nu-\partial_\nu A_\mu+[A_\mu,A_\nu]. }\] For an Abelian \(U(1)\) connection, the commutator vanishes, so \(F=dA\).

Riemann curvature tensor

For a connection on \(TM\), the curvature acts on vector fields: \[R(X,Y)Z = \nabla_X\nabla_YZ- \nabla_Y\nabla_XZ- \nabla_{[X,Y]}Z.\] This is the Riemann curvature tensor of the connection. In coordinates, \[R(\partial_\mu,\partial_\nu)\partial_\sigma = R^\rho{}_{\sigma\mu\nu}\partial_\rho.\] Using \(A_\mu{}^\rho{}_{\sigma}=\Gamma^\rho{}_{\mu\sigma}\) in the formula for \(F_{\mu\nu}\) gives \[\boxed{ R^\rho{}_{\sigma\mu\nu} = \partial_\mu\Gamma^\rho{}_{\nu\sigma} - \partial_\nu\Gamma^\rho{}_{\mu\sigma} + \Gamma^\rho{}_{\mu\lambda}\Gamma^\lambda{}_{\nu\sigma} - \Gamma^\rho{}_{\nu\lambda}\Gamma^\lambda{}_{\mu\sigma}. }\] For the Levi-Civita connection, this is the usual Riemann curvature tensor of Riemannian geometry.

How curvature transforms under gauge transformations

Under a frame change, \[A_j=g_{ij}^{-1}A_i g_{ij}+g_{ij}^{-1}dg_{ij}.\] A direct computation using \(d(g^{-1})=-g^{-1}(dg)g^{-1}\) gives \[\boxed{F_j=g_{ij}^{-1}F_i g_{ij}.}\] Thus the connection one-form \(A\) transforms inhomogeneously, but the curvature transforms covariantly.

For \(U(1)\), conjugation is trivial, so \[F_j=F_i.\] The local curvatures glue to a globally defined ordinary two-form \(F\). The potentials \(A_i\) may not glue to one global one-form.

Parallel transport and holonomy

Let \[\gamma:[0,1]\to M\] be a path. A section of \(E\) along \(\gamma\) is a map \[v(t)\in E_{\gamma(t)}.\] More formally, it is a section of the pullback bundle \(\gamma^*E\to[0,1]\).

In a local frame along the path, write \(v(t)\) as a component vector. The covariant derivative along the path is \[\boxed{ D_t v:=\nabla_{\dot\gamma(t)}v = \frac{dv}{dt}+A_{\gamma(t)}(\dot\gamma(t))v. }\] Here inserting \(\dot\gamma(t)\) into the one-form \(A\) means \[A_{\gamma(t)}(\dot\gamma(t)) = A_\mu(\gamma(t))\dot\gamma^\mu(t).\] A vector is parallel along \(\gamma\) if \[D_t v=0.\] Thus parallel transport is the solution of the ordinary differential equation \[\frac{dv}{dt}=-A_\mu(\gamma(t))\dot\gamma^\mu(t)v.\]

The formal solution is \[v(1)=\mathcal P\exp\left(-\int_\gamma A\right)v(0).\] The path-ordering symbol \(\mathcal P\) is needed because matrices \(A_\mu(\gamma(t_1))\) and \(A_\nu(\gamma(t_2))\) may not commute at different times.

A precise definition is \[\boxed{ \mathcal P\exp\left(-\int_\gamma A\right) := \lim_{N\to\infty} \prod_{n=N}^{1} \left(I-A_{\gamma(t_n)}(\dot\gamma(t_n))\Delta t\right), }\] where \(0=t_0<t_1<\cdots<t_N=1\) and \(\Delta t=1/N\). The factor with larger time stands to the left because it acts later on the vector.

If \(\gamma\) is a closed loop, the resulting linear map from \(E_{\gamma(0)}\) to itself is called holonomy: \[\mathop{\mathrm{Hol}}_\gamma(A)=\mathcal P\exp\left(-\oint_\gamma A\right).\] For \(U(1)\), path ordering is unnecessary, and holonomy is simply a phase, up to convention: \[\mathop{\mathrm{Hol}}_\gamma(A)=\exp\left(-\oint_\gamma A\right).\] This is the bundle language for Berry phase and Aharonov-Bohm phase.

Infinitesimal holonomy and curvature

Let us derive the standard small-loop formula. Work in local coordinates and consider the small rectangle based at \(x\) with sides \(dx^\mu\) and \(dx^\nu\). Parallel transport in the \(+\mu\) direction contributes \[I-A_\mu(x)dx^\mu.\] Transport in the \(+\nu\) direction at the shifted point contributes \[I-\left(A_\nu(x)+\partial_\mu A_\nu(x)dx^\mu\right)dx^\nu.\] Transport back in the \(-\mu\) direction contributes \[I+\left(A_\mu(x)+\partial_\nu A_\mu(x)dx^\nu\right)dx^\mu.\] Transport back in the \(-\nu\) direction contributes \[I+A_\nu(x)dx^\nu.\] Multiplying these four factors in the order in which they act and keeping only terms of order \(dx^\mu dx^\nu\) gives \[\mathop{\mathrm{Hol}}_{\square} = I- \left( \partial_\mu A_\nu- \partial_\nu A_\mu+[A_\mu,A_\nu] \right)dx^\mu dx^\nu +O(|dx|^3).\] Therefore \[\boxed{\mathop{\mathrm{Hol}}_{\square}\approx I-F_{\mu\nu}dx^\mu dx^\nu.}\] Curvature is the infinitesimal holonomy per unit area.

Curvature as infinitesimal holonomy Curvature is the infinitesimal version of holonomy. Parallel transport around a very small loop spanning area elements \(dx^\mu dx^\nu\) gives \[\mathop{\mathrm{Hol}}_\square\approx I-F_{\mu\nu}dx^\mu dx^\nu.\] Thus \(F\) measures how much a vector fails to come back to itself after parallel transport around an infinitesimal loop.

Principal connections

Let \(P\to M\) be a principal \(G\)-bundle with Lie algebra \(\mathfrak g\). At \(p\in P\), the vertical subspace is \[V_pP:=\ker(d\pi_p)\subset T_pP.\] It consists of tangent directions that move only along the fibre. A principal connection can be described as a smooth choice of horizontal subspaces \[T_pP=H_pP\oplus V_pP\] compatible with the right \(G\)-action.

Equivalently, a principal connection is a \(\mathfrak g\)-valued one-form \[\omega\in\Omega^1(P,\mathfrak g)\] satisfying:

  1. on vertical vectors generated by \(X\in\mathfrak g\), it returns \(X\),

  2. under the right action \(R_g:P\to P\), it transforms as \[R_g^*\omega=\mathop{\mathrm{Ad}}_{g^{-1}}\omega.\]

A local gauge field is obtained by choosing a local section \(s_i:U_i\to P\) and pulling back: \[A_i=s_i^*\omega\in\Omega^1(U_i,\mathfrak g).\] Thus the physics object \(A_i=A_{i,\mu}dx^\mu\) is the local expression of the global principal connection \(\omega\).

Geometric meaning A principal connection tells us which tangent directions in the total space \(P\) count as horizontal, meaning genuine motion along the base \(M\) rather than pure motion along the gauge fibre. Parallel transport is horizontal lifting of paths from \(M\) to \(P\).

Bianchi identity

The curvature satisfies the Bianchi identity \[d_AF=0,\] where \[d_AF=dF+A\wedge F-F\wedge A.\] For \(U(1)\), this reduces to \[dF=0.\] In electromagnetism, \(dF=0\) contains the homogeneous Maxwell equations.

Lie derivative versus connection

Now the difference can be stated cleanly.

Operation Meaning
\(\mathcal{L}_X\) Differentiates tensor fields on \(M\) by dragging them along the flow of \(X\). It is canonical and needs no connection.
\(\nabla_X\) Differentiates sections by comparing nearby fibres using a chosen connection. It depends on extra geometric data.

For functions, \[\mathcal{L}_Xf=X[f]=\nabla_Xf.\] For vector fields, if \(\nabla\) is torsion-free, \[\mathcal{L}_XY=[X,Y]=\nabla_XY-\nabla_YX.\] Thus \(\mathcal{L}_XY\) is not simply \(\nabla_XY\). For sections of an arbitrary associated vector bundle, there is no canonical Lie derivative unless the base-space flow is lifted to the bundle. A connection, however, always gives \(\nabla_Xs\).

Characteristic Classes and the First Chern Number

We have now built the geometric objects needed for Chern numbers: \[\text{vector bundle} \quad + \quad \text{connection} \quad \Longrightarrow \quad \text{curvature}.\] Characteristic classes are topological invariants extracted from this data. The first Chern class is the most important example for complex line bundles and two-dimensional band topology.

The problem characteristic classes solve

A bundle may be locally trivial but globally twisted. The twisting can be described in two different languages:

  1. transition functions on overlaps,

  2. curvature of a connection.

The first language is visibly topological: transition functions remember how patches are glued. The second language is differential-geometric: curvature measures infinitesimal holonomy. Characteristic classes explain why these two languages give the same global invariants.

Two languages for the same topology \[\begin{align*} \text{Transition functions} &\quad \text{record topology by winding and cocycle data},\\ \text{Curvature forms} &\quad \text{record topology by closed forms and quantized integrals}. \end{align*}\] The first Chern class is the bridge between these two descriptions.

Core idea A characteristic class is a cohomology class naturally associated to a vector bundle. It does not depend on the particular connection used to compute it, although curvature gives convenient differential-form representatives of it.

From local potentials to global curvature

Let \(L\to M\) be a complex line bundle with a unitary connection. Choose local frames on patches \(U_i\). In each patch the connection is represented by a local one-form \[A_i\in\Omega^1(U_i;i\mathbb{R}),\] where we use the mathematical anti-Hermitian convention. On overlaps, \[A_j=A_i+g_{ij}^{-1}dg_{ij}\] for \(g_{ij}:U_i\cap U_j\to U(1)\).

The curvature is \[F_i=dA_i.\] Since \(U(1)\) is Abelian, \[d(g_{ij}^{-1}dg_{ij})=0,\] so \[F_j=F_i.\] Thus the local two-forms \(F_i\) glue to a globally defined two-form \[F\in\Omega^2(M;i\mathbb{R}).\] This is the first important point: the potential \(A_i\) may only exist locally, but the curvature \(F\) is global.

Important point Even when the gauge potential \(A\) cannot be chosen globally, the curvature \(F\) may be globally defined. A nonzero Chern number is precisely a signal that the local potentials \(A_i\) cannot be patched into one smooth global potential.

If \(A\) were a globally defined one-form on a closed oriented surface \(\Sigma\), then Stokes’ theorem would give \[\int_\Sigma F = \int_\Sigma dA = \int_{\partial\Sigma}A =0,\] because \(\partial\Sigma=\varnothing\). Therefore a nonzero integral \[\int_\Sigma F\neq 0\] forces the gauge potential to be only locally defined. This is the first geometric meaning of a nontrivial bundle.

Local potential versus global connection Saying that \(A\) is local does not mean that the physics is nonlocal. It means that \(A_i\) is the coordinate expression of a global connection after choosing a local frame. On a nontrivial line bundle there may be no single global potential \(A\) satisfying \(F=dA\) everywhere. The obstruction to finding such a global \(A\) is exactly what the first Chern class measures.

Closed forms, exact forms, and de Rham cohomology

The curvature of a connection satisfies the Bianchi identity. For a line bundle this is simply \[dF=0.\] So \(F\) is a closed two-form.

Definition 28 (Closed and exact forms). A differential form \(\omega\in\Omega^k(M)\) is closed if \[d\omega=0.\] It is exact if there exists \(\eta\in\Omega^{k-1}(M)\) such that \[\omega=d\eta.\]

Every exact form is closed because \(d^2=0\). The converse need not hold. De Rham cohomology measures the failure of closed forms to be exact.

Definition 29 (De Rham cohomology). The \(k\)-th de Rham cohomology group is \[H^k_{\mathrm{dR}}(M) = \frac{\{\omega\in\Omega^k(M):d\omega=0\}} {\{d\eta:\eta\in\Omega^{k-1}(M)\}}.\] The cohomology class of a closed form \(\omega\) is denoted \([\omega]_{\mathrm{dR}}\).

Why does cohomology appear here? Because curvature is closed, and changing a connection changes the curvature by an exact form. Therefore the cohomology class of the curvature is independent of the connection.

Indeed, if \(\nabla^0\) and \(\nabla^1\) are two connections on the same line bundle, then locally their connection forms differ by a globally defined \(i\mathbb{R}\)-valued one-form \(\alpha\): \[A^1_i-A^0_i=\alpha|_{U_i}.\] Therefore \[F^1-F^0=d\alpha.\] So \[[F^1]_{\mathrm{dR}}=[F^0]_{\mathrm{dR}}.\]

Integral cohomology and quantization

De Rham cohomology uses real coefficients. But line bundles are classified by integral data. For a complex line bundle, the relevant topological invariant is \[c_1(L)\in H^2(M;\mathbb{Z}),\] the first Chern class.

There is a natural map from integral cohomology to real de Rham cohomology, \[H^2(M;\mathbb{Z})\to H^2_{\mathrm{dR}}(M),\] which forgets the integrality but remembers the corresponding real cohomology class. A de Rham class is called integral if it lies in the image of this map.

For a unitary connection on a line bundle, the normalized curvature \[\frac{F}{2\pi i}\] is a real closed two-form. Its de Rham cohomology class is the image of the integral class \(c_1(L)\): \[\boxed{ \left[\frac{F}{2\pi i}\right]_{\mathrm{dR}} = \text{image of }c_1(L)\in H^2(M;\mathbb{Z}). }\] Consequently, for every closed oriented surface \(\Sigma\subset M\), \[\boxed{ \int_\Sigma \frac{F}{2\pi i} = \langle c_1(L),[\Sigma]\rangle \in\mathbb{Z}. }\] This is the mathematical origin of flux quantization and Chern-number quantization.

Why \(c_1(L)\) is integral: transition-function proof

Here is the concrete proof of integrality.

Let \(\{U_i\}\) be a good open cover of \(M\), meaning all nonempty finite intersections are contractible. Let \[g_{ij}:U_i\cap U_j\to U(1)\] be the transition functions of a complex line bundle \(L\). They satisfy \[g_{ij}g_{jk}g_{ki}=1\] on triple overlaps \(U_i\cap U_j\cap U_k\).

Because \(U_i\cap U_j\) is contractible, choose real-valued functions \(\chi_{ij}\) such that \[g_{ij}=e^{i\chi_{ij}}.\] On a triple overlap, \[e^{i(\chi_{ij}+\chi_{jk}+\chi_{ki})}=1.\] Therefore \[\chi_{ij}+\chi_{jk}+\chi_{ki}=2\pi n_{ijk}\] for some integer-valued locally constant function \[n_{ijk}:U_i\cap U_j\cap U_k\to\mathbb{Z}.\] On a good cover, locally constant means constant on each connected component. The collection \(\{n_{ijk}\}\) satisfies the Cech two-cocycle condition with integer coefficients. Its cohomology class \[[n_{ijk}]\in H^2(M;\mathbb{Z})\] is, by definition, the first Chern class: \[\boxed{c_1(L)=[n_{ijk}]\in H^2(M;\mathbb{Z}).}\] This proves that \(c_1(L)\) is integral because it is constructed from integer-valued cocycle data.

Changing the choices of logarithms \(\chi_{ij}\) changes \(n_{ijk}\) by an integer Cech coboundary. Changing local frames changes the transition functions by a Cech coboundary. Therefore the cohomology class \([n_{ijk}]\) is well-defined even though the representatives are not.

Curvature representative of the same integer class

Now connect the integer Cech class to curvature. Locally choose connection one-forms \(A_i\in\Omega^1(U_i;i\mathbb{R})\) satisfying \[A_j=A_i+g_{ij}^{-1}dg_{ij}.\] Since \(g_{ij}=e^{i\chi_{ij}}\), \[g_{ij}^{-1}dg_{ij}=i\,d\chi_{ij}.\] Hence \[A_j-A_i=i\,d\chi_{ij}.\] Taking exterior derivatives gives \[dA_j=dA_i,\] so \(F=dA_i\) is global. The de Rham theorem and the Cech-de Rham comparison imply that \[\left[\frac{F}{2\pi i}\right]_{\mathrm{dR}}\] is the real image of the integer Cech class \([n_{ijk}]\). Thus \(F/(2\pi i)\) is the differential-form representative of \(c_1(L)\).

For a closed oriented surface \(\Sigma\), this gives \[\int_\Sigma\frac{F}{2\pi i}\in\mathbb{Z}.\] The integer is the first Chern number of \(L\) over \(\Sigma\).

First Chern class and first Chern number

For a complex line bundle \(L\to M\) with unitary connection curvature \(F\), \[\boxed{ c_1(L) \quad\text{is represented in de Rham cohomology by}\quad \frac{F}{2\pi i}. }\] If \(\Sigma\) is a closed oriented two-dimensional manifold and \(L\to\Sigma\), the first Chern number is \[\boxed{ C_1(L)=\int_\Sigma\frac{F}{2\pi i}\in\mathbb{Z}. }\] In physics conventions one often uses a real Berry curvature \(\mathcal F\) instead of an anti-Hermitian curvature \(F\). Then the same formula is usually written \[C_1=\frac{1}{2\pi}\int_\Sigma \mathcal F.\] The two formulas differ only by the convention \(F=i\mathcal F\) or \(F=-i\mathcal F\).

Physical meaning of the first Chern number

The first Chern number measures the total twisting of a complex line bundle over a closed surface. In different physical contexts it appears as:

  • magnetic flux in units of \(2\pi\),

  • monopole charge,

  • Berry curvature flux through parameter space,

  • Chern number of a two-dimensional band,

  • integer quantum Hall conductance in units of \(e^2/h\) for a filled band.

The integrality is not an approximation. It is a topological statement: smooth deformations of the bundle or connection cannot change the integer unless the bundle itself changes, which in band theory usually requires a gap closing.

Holonomy, curvature, and Stokes’ theorem

If \(A\) is globally defined on a disk \(D\) with boundary \(\partial D\), then Stokes’ theorem gives \[\oint_{\partial D}A=\int_D dA=\int_DF.\] This is the local relation between holonomy and curvature.

On a nontrivial bundle, \(A\) may not be globally defined on a closed surface. One must use patches. The failure of the patch potentials to glue into one global potential is exactly what allows \[\int_\Sigma \frac{F}{2\pi i}\] to be a nonzero integer.

Winding number of a transition function

The Dirac monopole example uses a transition function on the equator, \[g:S^1\to U(1).\] The integer carried by such a map is called its winding number. Since this integer later becomes a Chern number, we spell out the definition carefully.

Parametrize the domain circle by \(\phi\in[0,2\pi]\) with endpoints identified. A smooth map \(g:S^1\to U(1)\) can be written locally as \[g(e^{i\phi})=e^{i\theta(\phi)}.\] The phase \(\theta\) need not be a single-valued function on the circle, but on the interval \([0,2\pi]\) we can choose a continuous lift \(\theta:[0,2\pi]\to\mathbb{R}\) satisfying \[g(e^{i\phi})=e^{i\theta(\phi)}.\] Because \(e^{i0}=e^{i2\pi}\) represents the same point of the domain circle, we must have \[e^{i\theta(2\pi)}=e^{i\theta(0)}.\] Therefore \[\theta(2\pi)-\theta(0)=2\pi n\] for a unique integer \(n\in\mathbb{Z}\).

Definition 30 (Winding number). The winding number of \(g:S^1\to U(1)\) is \[\operatorname{wind}(g):=\frac{\theta(2\pi)-\theta(0)}{2\pi}\in\mathbb{Z}.\] Equivalently, \[\boxed{ \operatorname{wind}(g)=\frac{1}{2\pi i}\int_{S^1}g^{-1}dg. }\]

The second formula is often the most useful in bundle theory. To verify it, write \(g=e^{i\theta}\) on the interval. Then \[g^{-1}dg=e^{-i\theta}d(e^{i\theta})=i\,d\theta,\] so \[\frac{1}{2\pi i}\int_{S^1}g^{-1}dg = \frac{1}{2\pi}\int_0^{2\pi}\frac{d\theta}{d\phi}\,d\phi = \frac{\theta(2\pi)-\theta(0)}{2\pi}.\]

Theorem 2 (Basic properties of winding number). Let \(g,h:S^1\to U(1)\) be smooth maps.

  1. \(\operatorname{wind}(gh)=\operatorname{wind}(g)+\operatorname{wind}(h)\).

  2. \(\operatorname{wind}(g^{-1})=-\operatorname{wind}(g)\).

  3. If \(g\) and \(h\) are homotopic as maps \(S^1\to U(1)\), then \(\operatorname{wind}(g)=\operatorname{wind}(h)\).

  4. Every integer occurs: \(g_n(e^{i\phi})=e^{in\phi}\) has \(\operatorname{wind}(g_n)=n\).

Consequently, \[[S^1,U(1)]\cong\pi_1(U(1))\cong\mathbb{Z},\] where the isomorphism sends a homotopy class to its winding number.

The homotopy invariance is important. The integral formula varies continuously under a smooth homotopy, but it is integer-valued. A continuous path in \(\mathbb{Z}\) is constant. Thus winding number cannot change under smooth deformation; it changes only if the map itself becomes ill-defined.

Why winding enters the first Chern number A complex line bundle over \(S^2\) can be described by gluing the northern and southern trivial bundles along the equator. The gluing function is a map \[g:S^1\to U(1).\] Its winding number records how many times the fibre phase rotates as one goes once around the equator. The first Chern number is the same integer, expressed in curvature language.

Dirac monopole as a principal \(U(1)\)-bundle over \(S^2\)

The Dirac monopole is the cleanest example where all the bundle language becomes concrete. The base space is \[B=S^2,\] the two-sphere surrounding the monopole. This \(S^2\) is not the whole physical space; it is a closed surface enclosing the monopole. The magnetic field through this sphere is encoded as curvature of a connection on a principal \(U(1)\)-bundle.

The principal bundle

For each point \(x\in S^2\), attach a copy of the gauge group \(U(1)\). The resulting principal bundle is \[\pi:P_n\to S^2, \qquad \pi^{-1}(x)\cong U(1).\] The subscript \(n\) denotes the integer Chern number. The fibre is a copy of \(U(1)\), but more precisely it is a \(U(1)\)-torsor: there is no preferred identity element in a fibre until one chooses a local gauge frame.

Cover \(S^2\) by northern and southern patches, \[U_N=S^2\setminus\{\text{south pole}\}, \qquad U_S=S^2\setminus\{\text{north pole}\}.\] Choose local sections \[s_N:U_N\to P_n, \qquad s_S:U_S\to P_n.\] On the overlap \(U_N\cap U_S\), which deformation retracts to the equator, suppose \[\boxed{s_N(x)=s_S(x)\cdot g_{SN}(x)}\] with \[g_{SN}:U_N\cap U_S\to U(1).\] Restricting to the equator and using the angular coordinate \(\phi\), take \[\boxed{g_{SN}(\phi)=e^{in\phi}.}\] Then \[\operatorname{wind}(g_{SN})=n.\] This integer labels the principal \(U(1)\)-bundle. For \(n=0\), the bundle is the trivial product \(S^2\times U(1)\). For \(n=1\), the total space is \(S^3\), and the projection \[S^3\to S^2\] is the Hopf fibration. Thus the Hopf fibration is the unit magnetic monopole bundle.

What is attached to \(S^2\)? For the principal bundle, the fibre over each point of \(S^2\) is the gauge-frame space \(U(1)\). This is not yet the electron wavefunction space. The wavefunction lives in an associated complex line bundle, constructed from a representation of \(U(1)\).

The connection and the local gauge potentials

A principal connection is a global one-form \[\omega\in\Omega^1(P_n;i\mathbb{R})\] on the total space \(P_n\). Local gauge potentials are obtained by pulling back \(\omega\) along local sections: \[A_N:=s_N^*\omega\in\Omega^1(U_N;i\mathbb{R}), \qquad A_S:=s_S^*\omega\in\Omega^1(U_S;i\mathbb{R}).\] Using \(s_N=s_S\cdot g_{SN}\) and the principal-connection transformation law gives, because \(U(1)\) is Abelian, \[\boxed{A_N=A_S+g_{SN}^{-1}dg_{SN}.}\] For \[g_{SN}(\phi)=e^{in\phi},\] we have \[g_{SN}^{-1}dg_{SN}=in\,d\phi.\] Thus the local potentials differ by an exact-looking term on the overlap, but the phase function \(n\phi\) is not single-valued on the equator unless \(n=0\).

The curvature of the principal connection is a global two-form on \(S^2\) after pullback to local patches: \[F|_{U_N}=dA_N, \qquad F|_{U_S}=dA_S.\] The equality on the overlap follows from \[d(A_N-A_S)=d(g_{SN}^{-1}dg_{SN})=0.\] Thus \(A_N\) and \(A_S\) are local, but \(F\) is global.

Patchwise Stokes theorem: Chern number equals winding number

Choose the orientation of the equator to be the boundary orientation of \(U_N\). Then the boundary orientation of \(U_S\) is the opposite. Applying Stokes’ theorem patch by patch gives \[\begin{align*} \int_{S^2}F &= \int_{U_N}dA_N+ \int_{U_S}dA_S \\ &= \int_{\partial U_N}A_N+ \int_{\partial U_S}A_S \\ &= \int_{S^1}A_N- \int_{S^1}A_S \\ &= \int_{S^1}(A_N-A_S) \\ &= \int_{S^1}g_{SN}^{-1}dg_{SN}. \end{align*}\] Therefore \[\boxed{ \int_{S^2}\frac{F}{2\pi i} = \frac{1}{2\pi i}\int_{S^1}g_{SN}^{-1}dg_{SN} = \operatorname{wind}(g_{SN}) =n. }\] This is the precise sense in which monopole charge, transition-function winding, and first Chern number are the same integer.

Sign convention If one instead writes \(s_S=s_N\cdot g_{NS}\), then \(g_{NS}=g_{SN}^{-1}\) and the winding number changes sign. The invariant statement is not the name \(g_{SN}\) versus \(g_{NS}\); the invariant statement is that, once the transition-function and orientation conventions are fixed, \[\int_{S^2}\frac{F}{2\pi i}\] equals the corresponding transition-function winding number.

Associated line bundle and charged wavefunctions

Quantum mechanics needs wavefunctions. A charged wavefunction is not a section of the principal bundle \(P_n\) itself. It is a section of a complex line bundle associated to \(P_n\).

Let \[\rho_q:U(1)\to\mathop{\mathrm{GL}}(\mathbb{C})\] be a one-dimensional representation. Mathematically, continuous representations of \(U(1)\) on \(\mathbb{C}\) have integer weight. With the convention \[\rho_q(e^{i\alpha})=e^{-iq\alpha}, \qquad q\in\mathbb{Z},\] the associated line bundle is \[\boxed{L_q=P_n\times_{U(1)}\mathbb{C}.}\] By definition, \[L_q=(P_n\times\mathbb{C})/\sim,\] where \[(p,z)\sim(p\cdot h,\rho_q(h^{-1})z), \qquad h\in U(1).\] A charged wavefunction is a section \[\psi\in\Gamma(L_q).\] After choosing a local section \(s_N\) or \(s_S\), the same global wavefunction is represented by local complex functions \[\psi_N:U_N\to\mathbb{C}, \qquad \psi_S:U_S\to\mathbb{C}.\] On the overlap, since \(s_N=s_S\cdot g_{SN}\), the local components transform by the representation: \[\psi_S=\rho_q(g_{SN})\psi_N =e^{-iqn\phi}\psi_N.\] This is the ordinary phase transformation of a charge-\(q\) wavefunction. The point is that the phase change is not an optional decoration; it is forced by the associated-bundle construction.

Physical interpretation The base space is \(S^2\). The principal bundle attaches gauge frames \(U(1)\) to each point. The associated line bundle attaches the internal charge space \(\mathbb{C}\) to each point. The electron’s scalar wavefunction is a section of this associated complex line bundle, not a spin-space object and not an ordinary globally defined function unless the line bundle is trivial.

The connection on \(P_n\) induces a connection on \(L_q\). In a local gauge, it has the familiar form \[D\psi=d\psi-iq\,\mathcal A\,\psi\] if one writes the physical real gauge potential as \(\mathcal A\) rather than the anti-Hermitian form \(A=i\mathcal A\). This is the standard minimal coupling rule. In bundle language it is simply the connection on the associated line bundle.

The usual Dirac monopole potentials

In the physics convention with real gauge potentials, one often writes \[\mathcal A_N=\frac{n}{2}(1-\cos\theta)d\phi, \qquad \mathcal A_S=-\frac{n}{2}(1+\cos\theta)d\phi.\] Then \[\mathcal A_N-\mathcal A_S=n\,d\phi,\] and \[\mathcal F=d\mathcal A_N=d\mathcal A_S=\frac{n}{2}\sin\theta\,d\theta\wedge d\phi.\] Therefore \[\frac{1}{2\pi}\int_{S^2}\mathcal F = \frac{1}{2\pi}\int_0^{2\pi}\int_0^\pi \frac{n}{2}\sin\theta\,d\theta\,d\phi =n.\] This is the same computation as above, written in the physicist’s real-curvature convention.

Homotopy, cohomology, and curvature in one example For the monopole bundle over \(S^2\), the following data all determine the same integer: \[\begin{align*} \text{homotopy:}&& [g_{SN}]&\in[S^1,U(1)]\cong\mathbb{Z},\\ \text{winding:}&& \operatorname{wind}(g_{SN})&=n,\\ \text{cohomology:}&& c_1(L)&\in H^2(S^2;\mathbb{Z})\cong\mathbb{Z},\\ \text{curvature:}&& \int_{S^2}\frac{F}{2\pi i}&=n. \end{align*}\] These are not four different phenomena. They are four languages for the same topological twisting.

The same example from homotopy

For \(S^2\) covered by two disks, the overlap deformation retracts to the equator \(S^1\). A complex line bundle is therefore specified by a gluing map \[g:S^1\to U(1).\] Homotopy classes of such maps are classified by \[\pi_1(U(1))\cong\mathbb{Z}.\] The integer is the winding number. The first Chern class packages this same integer as an element of \[H^2(S^2;\mathbb{Z})\cong\mathbb{Z}.\] The curvature formula packages it as \[\int_{S^2}\frac{F}{2\pi i}\in\mathbb{Z}.\] Thus homotopy, cohomology, transition functions, and curvature are four languages for the same topological information in this example.

Higher-rank bundles and characteristic classes

For a rank-\(r\) complex vector bundle \(E\to M\) with connection curvature \(F\), the total Chern class is formally represented by \[c(E)=\det\left(I+\frac{F}{2\pi i}\right) =1+c_1(E)+c_2(E)+\cdots.\] The first Chern class is represented by \[c_1(E)=\left[\frac{\mathrm{Tr}F}{2\pi i}\right].\] The Chern character is \[\operatorname{ch}(E)=\mathrm{Tr}\exp\left(\frac{F}{2\pi i}\right).\] For real vector bundles, other characteristic classes appear, especially Stiefel-Whitney classes \[w_i(E)\in H^i(M;\mathbb{Z}_2)\] and Pontryagin classes \[p_i(E)\in H^{4i}(M;\mathbb{Z}).\] For example, \[w_1(E)=0\] is the obstruction to orientability, and \[w_2(TM)=0\] is the obstruction to the existence of a spin structure on an oriented Riemannian manifold.

Minimal algebraic topology needed here

For the main text, the following operational facts are enough:

  • \(\pi_1(U(1))\cong\mathbb{Z}\): loops in \(U(1)\) have an integer winding number.

  • \(H^2(S^2;\mathbb{Z})\cong\mathbb{Z}\): line bundles over \(S^2\) are classified by an integer.

  • \(H^2(T^2;\mathbb{Z})\cong\mathbb{Z}\): line bundles over a two-torus can have an integer Chern number.

  • Curvature gives a de Rham representative of an integral cohomology class.

The appendix develops these facts from homotopy through cohomology.

Bloch Bundles and Chern Insulators

The previous sections developed bundles in abstract language. Bloch bundles are where the same language becomes directly physical in band theory.

Why the Brillouin zone is a torus

For a two-dimensional Chern insulator, the base space of the Bloch bundle is typically \(T^2\). This statement belongs here, after vector bundles have been defined. It is not a statement about real space. It is a statement about momentum space.

In a crystal, Bloch momentum \(k\) is defined only modulo reciprocal lattice vectors. In one dimension, \[k\sim k+G.\] Therefore the one-dimensional Brillouin zone is a circle: \[\mathrm{BZ}\cong S^1.\] In two dimensions, \[(k_x,k_y)\sim(k_x+G_x,k_y), \qquad (k_x,k_y)\sim(k_x,k_y+G_y).\] So the Brillouin zone is a two-torus: \[\mathrm{BZ}\cong S^1\times S^1=T^2.\]

Why this example is delayed The torus \(T^2\) appears as the base space only when we discuss two-dimensional band topology. It is not needed for the initial manifold review. It belongs here, after fibre bundles and vector bundles have already been defined.

Occupied states form a vector bundle

Let \[H(k)\] be a Bloch Hamiltonian depending smoothly on \(k\in\mathrm{BZ}\). Suppose there is an energy gap separating occupied bands from unoccupied bands. Let \(N_{\mathrm{occ}}\) be the number of occupied bands.

At each momentum \(k\), define \[E_k:=\operatorname{span}_{\mathbb{C}}\{\text{occupied eigenstates of }H(k)\}\subset \mathcal{H}_k.\] Here \(\mathcal{H}_k\) denotes the single-particle Hilbert space at momentum \(k\). In a finite-band tight-binding model one usually identifies all \(\mathcal{H}_k\) with one fixed finite-dimensional Hilbert space \(\mathcal{H}\cong\mathbb{C}^N\), so the ambient Hilbert bundle is trivial: \[\mathrm{BZ}\times\mathcal{H}\to\mathrm{BZ}.\] The occupied subspaces \(E_k\subset\mathcal{H}\) then define a subbundle \[E=\bigsqcup_{k\in\mathrm{BZ}}E_k\subset\mathrm{BZ}\times\mathcal{H}.\] Since \[\dim_\mathbb{C}E_k=N_{\mathrm{occ}}\] for all \(k\), this is a rank-\(N_{\mathrm{occ}}\) complex vector bundle over the Brillouin zone: \[\boxed{E\to\mathrm{BZ}.}\] This is the Bloch bundle.

Equivalently, let \[\mathcal P(k):\mathcal{H}\to\mathcal{H}\] be the spectral projection onto the occupied subspace: \[\mathcal P(k)=\sum_{a=1}^{N_{\mathrm{occ}}}|u_a(k)\rangle\langle u_a(k)|\] for any local orthonormal basis of occupied states. The image of \(\mathcal P(k)\) is \(E_k\). Smoothness of \(\mathcal P(k)\) is the coordinate-free way to say that the occupied subspaces vary smoothly with \(k\).

The principal frame bundle of occupied states

The Bloch vector bundle has an associated principal bundle of orthonormal frames. Define \[P_{\mathrm{occ}}\to\mathrm{BZ}\] by declaring the fibre over \(k\) to be \[(P_{\mathrm{occ}})_k = \{\text{unitary isomorphisms }p:\mathbb{C}^{N_{\mathrm{occ}}}\to E_k\}.\] Equivalently, a point of \((P_{\mathrm{occ}})_k\) is an ordered orthonormal frame \[(|u_1(k)\rangle,\ldots,|u_{N_{\mathrm{occ}}}(k)\rangle)\] of the occupied subspace. The structure group is \[G=U(N_{\mathrm{occ}}),\] acting on the right by changing the frame: \[p\cdot U:=p\circ U, \qquad U\in U(N_{\mathrm{occ}}).\] In basis language this says \[|u_a\rangle\mapsto |u_b\rangle U^b{}_a.\]

The Bloch bundle is recovered as the associated vector bundle \[\boxed{E\cong P_{\mathrm{occ}}\times_{U(N_{\mathrm{occ}})}\mathbb{C}^{N_{\mathrm{occ}}}.}\] The isomorphism is explicit: \[[p,v]\longmapsto p(v)\in E_k.\] This formula says that a frame \(p\) converts a coordinate vector \(v\in\mathbb{C}^{N_{\mathrm{occ}}}\) into the actual occupied state \(p(v)\in E_k\). If the frame is changed by \(U\), then \(v\) is changed by \(U^{-1}\) in the quotient, so the actual vector \(p(v)\) is unchanged.

Physics translation The principal bundle \(P_{\mathrm{occ}}\) is the bundle of choices of occupied-band basis. The associated vector bundle \(E\) is the bundle whose fibres are the occupied Hilbert spaces themselves. Local Bloch eigenvectors are local frames; their gauge freedom is the right action of \(U(N_{\mathrm{occ}})\).

Local frames are local choices of eigenvectors

A local frame of the Bloch bundle is a local smooth choice of occupied eigenvectors \[|u_1(k)\rangle,\ldots,|u_{N_{\mathrm{occ}}}(k)\rangle.\] On an overlap of two patches, two choices of eigenvectors differ by a unitary matrix: \[|u_a^{(j)}(k)\rangle=|u_b^{(i)}(k)\rangle\,g_{ij}^{ba}(k),\] where \[g_{ij}(k)\in U(N_{\mathrm{occ}}).\] Thus band gauge freedom is literally frame freedom in a vector bundle.

For a single occupied band, \(N_{\mathrm{occ}}=1\), the structure group reduces to \(U(1)\) and a local eigenvector can be changed by a phase: \[|u(k)\rangle\mapsto e^{i\chi(k)}|u(k)\rangle.\] If the line bundle has nonzero Chern number, no smooth nonvanishing eigenvector can be chosen globally over the entire Brillouin zone.

Berry connection as the projected derivative

The formula \[\mathcal A_{ab}=i\langle u_a|d u_b\rangle\] is not an arbitrary definition. It is the local expression of a natural connection on the occupied subbundle.

The ambient Hilbert bundle \[\mathrm{BZ}\times\mathcal{H}\to\mathrm{BZ}\] is trivial, so it has an ordinary flat derivative \(d\). However, if \(s(k)\in E_k\) is an occupied-band section, then \(ds\) need not remain inside \(E_k\); differentiating an occupied state can produce components in the unoccupied subspace. To get a derivative intrinsic to the occupied bundle, project back: \[\boxed{\nabla s:=\mathcal P\,ds.}\] This is called the Grassmann connection on the subbundle \(E\subset\mathrm{BZ}\times\mathcal{H}\).

Apply this to a local occupied frame \(|u_b\rangle\). Since \[\mathcal P=\sum_a |u_a\rangle\langle u_a|,\] we get \[\nabla |u_b\rangle = \mathcal P\,d|u_b\rangle = \sum_a |u_a\rangle\langle u_a|d u_b\rangle.\] Compare this with the general local expression for a connection, \[\nabla e_b=e_a A^a{}_b.\] With \(e_a=|u_a\rangle\), the connection matrix is therefore \[\boxed{A^a{}_b=\langle u_a|d u_b\rangle.}\] Because the frame is orthonormal, \[d\langle u_a|u_b\rangle=0,\] so \[\langle d u_a|u_b\rangle+\langle u_a|d u_b\rangle=0.\] This implies \[A^\dagger=-A,\] so \(A\) is \(\mathfrak u(N_{\mathrm{occ}})\)-valued, as a unitary connection should be.

Physics often uses a Hermitian Berry connection \[\boxed{\mathcal A:=iA,\qquad \mathcal A_{ab}=i\langle u_a|d u_b\rangle.}\] This is the same connection written with a different convention. The mathematical convention uses anti-Hermitian matrices; the physics convention often inserts a factor of \(i\) so that the components are real in the one-band case.

In coordinates, \[\mathcal A_{ab}=i\langle u_a(k)|\partial_{k_i}u_b(k)\rangle\,dk_i.\] For a single occupied band, \[\mathcal A=i\langle u(k)|d u(k)\rangle.\]

Gauge transformation of the Berry connection

Let a local occupied frame be changed by \[|u_a\rangle\mapsto |u'_a\rangle=|u_b\rangle U^b{}_a(k), \qquad U(k)\in U(N_{\mathrm{occ}}).\] In the anti-Hermitian convention \(A_{ab}=\langle u_a|d u_b\rangle\), the connection transforms as \[\boxed{A' = U^{-1}AU+U^{-1}dU.}\] This is the same formula as for a vector-bundle connection.

For a single band, write \[|u'\rangle=e^{i\chi}|u\rangle.\] Then \[\begin{align*} \mathcal A' &=i\langle u'|d u'\rangle \\ &=i\langle u|e^{-i\chi}d(e^{i\chi}|u\rangle) \\ &=i\langle u|(i\,d\chi)|u\rangle+i\langle u|du\rangle \\ &=\mathcal A-d\chi. \end{align*}\] The Berry connection is not gauge-invariant. It depends on the phase convention for the local eigenvector. The Berry curvature is gauge-invariant for a line bundle: \[\mathcal F=d\mathcal A.\] For several occupied bands, the curvature in the anti-Hermitian convention is \[F=dA+A\wedge A,\] and in the common physics convention it is often written \[\mathcal F=d\mathcal A-i\mathcal A\wedge\mathcal A.\] The trace \(\mathrm{Tr}\mathcal F\) is gauge-invariant and enters the first Chern number.

Chern number of occupied bands

For a single occupied band over a two-dimensional Brillouin zone, \[\boxed{C=\frac{1}{2\pi}\int_{\mathrm{BZ}}\mathcal F\in\mathbb{Z}.}\] When \(\mathrm{BZ}\cong T^2\), this is an integral over the momentum-space torus.

For multiple occupied bands, the first Chern number of the occupied bundle is \[\boxed{C=\frac{1}{2\pi}\int_{\mathrm{BZ}}\mathrm{Tr}\mathcal F\in\mathbb{Z},}\] with the convention that \(\mathcal F\) is the Hermitian/real physics curvature. In the anti-Hermitian mathematical convention, the same formula is \[C=\int_{\mathrm{BZ}}\frac{\mathrm{Tr}F}{2\pi i}.\]

Real space versus momentum space The base space of the Bloch bundle is momentum space, usually the Brillouin zone. The physical sample lives in real space. For a two-dimensional crystal, real space is two-dimensional and the Brillouin zone is also two-dimensional, but they are different spaces.

Why the Chern number gives Hall conductivity: TKNN in brief

For a clean noninteracting two-dimensional band insulator at zero temperature, linear response theory gives the Hall conductivity. The starting point is the Kubo formula. For simplicity, consider nondegenerate bands and a completely filled set of occupied bands: \[\sigma_{xy} = -i\hbar e^2\int_{\mathrm{BZ}}\frac{d^2k}{(2\pi)^2} \sum_{n\in\mathrm{occ}}\sum_{m\in\mathrm{emp}} \frac{ \langle u_n|v_x|u_m\rangle\langle u_m|v_y|u_n\rangle - (x\leftrightarrow y) }{(E_n-E_m)^2}.\] Here \[v_i=\frac{1}{\hbar}\frac{\partial H}{\partial k_i}.\] Different sign conventions for electron charge and Berry curvature can move an overall sign; the invariant content is the integer multiplying \(e^2/h\).

The key identity comes from differentiating the eigenvalue equation \[H(k)|u_n(k)\rangle=E_n(k)|u_n(k)\rangle.\] For \(m\neq n\), \[\langle u_m|\partial_{k_i}H|u_n\rangle = (E_n-E_m)\langle u_m|\partial_{k_i}u_n\rangle.\] Substituting this into the Kubo formula cancels the energy denominators. The sum over empty states can then be rewritten using the completeness relation \[\sum_{m\in\mathrm{all}}|u_m\rangle\langle u_m|=I.\] After the antisymmetrization in \(x\) and \(y\), the occupied-state terms organize into the Berry curvature. For a single occupied band, \[\mathcal F_{xy} = \partial_{k_x}\mathcal A_y- \partial_{k_y}\mathcal A_x = i\left( \langle \partial_{k_x}u|\partial_{k_y}u\rangle - \langle \partial_{k_y}u|\partial_{k_x}u\rangle \right).\] Thus \[\sigma_{xy} = \frac{e^2}{h}\frac{1}{2\pi}\int_{\mathrm{BZ}}\mathcal F.\] For several occupied bands, \[\boxed{\sigma_{xy}=\frac{e^2}{h}\,C, \qquad C=\frac{1}{2\pi}\int_{\mathrm{BZ}}\mathrm{Tr}\mathcal F\in\mathbb{Z}.}\]

Why this is robust The Kubo formula begins as a detailed expression involving energies and matrix elements. The gap lets it collapse to the curvature of the occupied bundle. The integral of that curvature is a Chern number, hence an integer. Smooth perturbations cannot change this integer unless the occupied and unoccupied bands touch, because only then can the vector bundle itself change topology.

This is the geometric core of the integer quantum Hall effect and Chern insulators. Disorder and interactions require more sophisticated formulations, but the clean band-theory statement already shows why topology enters transport.

Two-band model and degree of a map

A common two-band Hamiltonian has the form \[H(k)=\bm d(k)\cdot\bm\sigma.\] If \(\bm d(k)\neq0\) for all \(k\in\mathrm{BZ}\), define \[\widehat{\bm d}(k)=\frac{\bm d(k)}{|\bm d(k)|}.\] Then \[\widehat{\bm d}:\mathrm{BZ}\to S^2.\] For a two-dimensional Brillouin zone, this is a map from the torus \(T^2\) to the unit sphere.

The unit sphere has a normalized area form \[\omega_{S^2} = \frac{1}{4\pi} \left( n_1\,dn_2\wedge dn_3 +n_2\,dn_3\wedge dn_1 +n_3\,dn_1\wedge dn_2 \right),\] where \((n_1,n_2,n_3)\) are the coordinate functions restricted to \(S^2\subset\mathbb{R}^3\). It is normalized by \[\int_{S^2}\omega_{S^2}=1.\] Pull this two-form back along \(\widehat{\bm d}\). Since \[d\widehat{\bm d} = \frac{\partial\widehat{\bm d}}{\partial k_x}dk_x + \frac{\partial\widehat{\bm d}}{\partial k_y}dk_y,\] a direct wedge-product computation gives \[\widehat{\bm d}^{\,*}\omega_{S^2} = \frac{1}{4\pi} \widehat{\bm d}\cdot \left( \frac{\partial\widehat{\bm d}}{\partial k_x} \times \frac{\partial\widehat{\bm d}}{\partial k_y} \right) dk_x\wedge dk_y.\]

Theorem 3 (Degree formula). Let \(M\) and \(N\) be compact connected oriented smooth \(n\)-manifolds, and let \(f:M\to N\) be smooth. If \(\eta\) is any top-degree form on \(N\), then \[\int_M f^*\eta=\deg(f)\int_N\eta.\] The integer \(\deg(f)\) is the degree of \(f\).

Applying this theorem to \[f=\widehat{\bm d}:\mathrm{BZ}\to S^2, \qquad \eta=\omega_{S^2},\] we obtain \[\int_{\mathrm{BZ}}\widehat{\bm d}^{\,*}\omega_{S^2} =\deg(\widehat{\bm d}).\] Therefore \[\boxed{ \deg(\widehat{\bm d}) = \frac{1}{4\pi}\int_{\mathrm{BZ}} \widehat{\bm d}\cdot \left( \partial_{k_x}\widehat{\bm d}\times \partial_{k_y}\widehat{\bm d} \right)d^2k. }\] This integer is the oriented wrapping number of the map \(\widehat{\bm d}:T^2\to S^2\).

For the eigenline whose spin is aligned with \(\widehat{\bm d}\), the first Chern number is \[C_+=\deg(\widehat{\bm d})\] with the Berry-curvature convention used above. For the lower band of the Hamiltonian \(H=\bm d\cdot\bm\sigma\), the spin is anti-aligned with \(\widehat{\bm d}\), so many conventions give \[C_-=-\deg(\widehat{\bm d}).\] If one writes the Hamiltonian with the opposite sign, or defines the occupied eigenline differently, this sign flips. The invariant geometric statement is that the band Chern number is the degree of the map to the Bloch sphere, up to the convention-fixed sign.

Chern number as wrapping number The integrand \[\widehat{\bm d}\cdot (\partial_{k_x}\widehat{\bm d}\times\partial_{k_y}\widehat{\bm d})\] is the Jacobian measuring how oriented area in the Brillouin zone maps to oriented area on the Bloch sphere. Dividing by \(4\pi\) normalizes the total area of the unit sphere to one. The integral counts how many times the Brillouin-zone torus covers the sphere, with orientation.

Relation to Gauge Theory and Topological Order

Gauge theory in bundle language

The bundle-level definition of a gauge theory is: \[\text{principal }G\text{-bundle }P\to M \quad + \quad \text{connection }A.\] Matter fields in representation \(\rho:G\to \mathop{\mathrm{GL}}(V)\) are sections of \[P\times_G V.\] The field strength is the curvature \[F=dA+A\wedge A.\] Wilson loops are holonomies: \[W_R(\gamma)=\mathrm{Tr}_R\,\mathcal P\exp\left(-\oint_\gamma A\right).\]

Flat bundles and topological sectors

A connection is flat if \[F=0.\] Flat does not necessarily mean globally trivial. A flat connection can have nontrivial holonomy around noncontractible loops.

For example, on a torus \(T^2\), there are two fundamental noncontractible cycles. A flat \(\mathbb{Z}_2\) gauge field can have holonomy \(\pm1\) around each cycle. Thus there are four sectors: \[(+,+),\quad (+,-),\quad (-,+),\quad (-,-).\] This is the continuum/bundle intuition behind why \(\mathbb{Z}_2\) gauge theories and the toric code have topological sectors on a torus.

Connection to constrained Hilbert spaces The constrained-Hilbert-space viewpoint says a local constraint can behave like Gauss’s law. The bundle viewpoint says that gauge sectors are global data of gauge fields, such as holonomies around noncontractible cycles. These are two complementary descriptions of the same kind of physics.

Relation to Projective Representations and Spin

Spin-\(1/2\) is naturally a projective representation of \(SO(3)\) but a linear representation of \(SU(2)\). Bundle language gives a geometric version of the same idea.

Let \(M\) be an oriented Riemannian \(n\)-manifold. At each point, choose an oriented orthonormal frame of \(T_pM\). The collection of all such frames forms a principal \(SO(n)\)-bundle: \[P_{SO}(M)\to M.\] Spinors require lifting this principal bundle to a principal \(Spin(n)\)-bundle: \[P_{Spin}(M)\to M.\] The obstruction to doing this is the second Stiefel–Whitney class \[w_2(TM)\in H^2(M;\mathbb{Z}_2).\] If \[w_2(TM)=0,\] then a spin structure exists.

Physical memory rule Projective representations become linear after passing to a suitable covering group: \[SO(3)\leftarrow SU(2), \qquad SO(n)\leftarrow Spin(n).\] Spin structures are the bundle version of making this lift consistently over all of spacetime or space.

This topic is not needed for the first calculation of a Chern number, but it is a useful bridge between projective representations, topology, and geometry.

What Chern–Simons Theory and Anomalies Have to Do With This

This section is optional on a first pass. It is included only to place later PHYS 374 topics in context.

Chern class versus Chern–Simons action

Do not confuse these:

Concept Meaning
Chern class / Chern number Topological invariant of a vector bundle, built from curvature. Essential for Berry curvature and Chern insulators.
Chern–Simons action A topological field theory action in odd spacetime dimensions, built from a connection. Important for quantum Hall effective field theories and edge physics, but not required for first learning fibre bundles.

For a \(U(1)\) connection in \(2+1\) dimensions, a Chern–Simons term looks like \[S_{CS}[A]=\frac{k}{4\pi}\int A\wedge dA.\] This uses the language of connections and forms, but it is not the starting point for learning bundles.

Anomalies

A rough bundle-language statement of a ’t Hooft anomaly is:

There is an obstruction to defining the partition function consistently and gauge-invariantly for all background gauge bundles and gauge transformations.

This is conceptually downstream from bundles, connections, and gauge transformations. It is not required for understanding what a vector bundle or a Berry connection is.

Two-Day Minimal Study Plan

Day 1: Manifold and forms recovery

You should be able to explain the following without looking:

  1. A topological manifold is locally homeomorphic to open subsets of \(\mathbb{R}^n\).

  2. A smooth manifold has smooth coordinate transition maps.

  3. A tangent vector at \(p\) is a directional derivative operator.

  4. A cotangent vector at \(p\) is a linear functional on \(T_pM\).

  5. \(dx^\mu(\partial/\partial x^\nu)=\delta^\mu_\nu\) because \(dx^\mu=d(x^\mu)\).

  6. A \(k\)-form eats \(k\) tangent vectors and is antisymmetric.

  7. Pullback replaces \(y^i\) by \(f^i(x)\) and \(dy^i\) by \(d(f^i(x))\).

  8. A flow \(\Phi_t\) follows a vector field, and \(\mathcal{L}_XT=\left.\frac{d}{dt}\right|_{0}\Phi_t^*T\) is the infinitesimal pullback along that flow.

Day 2: Bundles and connections

You should be able to explain:

  1. A fibre bundle is a map \(\pi:E\to B\) locally isomorphic to \(U_i\times F\) over some open cover.

  2. The definition requires the existence of a trivializing open cover, not that every open set trivializes.

  3. A vector bundle is a fibre bundle with fibre \(\mathbb{R}^r\) or \(\mathbb{C}^r\) and linear transition functions.

  4. \(E\) is not automatically \(M\times\mathbb{R}^{\dim M}\); that is only the trivial bundle of rank \(\dim M\).

  5. \(TM\), \(T^*M\), and \(\Lambda^kT^*M\) are vector bundles over \(M\).

  6. A vector field is a section of \(TM\).

  7. A one-form is a section of \(T^*M\).

  8. Transition functions glue local trivializations.

  9. A connection differentiates sections by comparing nearby fibres.

  10. Locally \(\nabla=d+A\).

  11. Curvature is \(F=dA+A\wedge A\).

  12. For a complex line bundle over a closed surface, \((1/2\pi)\int F\in\mathbb{Z}\).

  13. A 2D Bloch bundle has base \(\mathrm{BZ}\cong T^2\), not real space.

Study Checkpoints

Local product does not mean global product

The Möbius line bundle and cylinder are both locally \(U\times\mathbb{R}\) over \(S^1\). They differ globally. The difference is encoded in transition functions.

Rank is not base dimension

A rank-\(r\) vector bundle over an \(n\)-dimensional base has \(r\)-dimensional fibres. The tangent bundle has rank \(n\), but a line bundle over \(S^2\) has rank \(1\), and a Bloch bundle over \(T^2\) can have rank equal to the number of occupied bands.

A differential form is not just an integral sign

A one-form is a covector field. It can be integrated over a curve because it eats the tangent vector to the curve and produces a scalar integrand.

Lie derivative is not the same as connection

\(\mathcal{L}_X\) differentiates by flow. \(\nabla_X\) differentiates by a chosen comparison rule between fibres. On arbitrary vector bundles, \(\nabla_X\) is the more general operation once a connection is chosen.

Gauge potentials are local

A gauge potential \(A_i\) is usually a local expression for a connection in patch \(U_i\). On overlaps, local expressions are related by gauge transformations.

Exercises With Short Solutions

Exercise 1 (Dual basis identity). Show that \[dx^\mu\left(\frac{\partial}{\partial x^\nu}\right)=\delta^\mu_\nu.\]

Solution. Since \(dx^\mu=d(x^\mu)\), \[dx^\mu\left(\frac{\partial}{\partial x^\nu}\right)=\frac{\partial x^\mu}{\partial x^\nu}=\delta^\mu_\nu.\]

Exercise 2 (Pullback of a one-form). Let \(f:\mathbb{R}^2\to\mathbb{R}^2\) be \[f(u,v)=(x,y)=(u^2+v,uv).\] Compute \(f^*(x\,dy)\).

Solution. Replace \(x\) by \(u^2+v\) and \(dy\) by \(d(uv)=v\,du+u\,dv\): \[f^*(x\,dy)=(u^2+v)(v\,du+u\,dv).\]

Exercise 3 (A bundle from transition functions). Let \(B=S^1\) and cover it by two arcs \(U_1,U_2\) with two disconnected overlaps. Let the fibre be \(\mathbb{R}\) and take transition function \(+1\) on one overlap component and \(-1\) on the other. Explain what bundle this gives.

Solution. This gluing reverses the fibre after going around the circle, so it gives the Möbius line bundle.

Exercise 4 (Section of a trivial bundle). Let \(E=M\times\mathbb{R}^r\). Show that a section of \(E\) is the same as a smooth function \(\psi:M\to\mathbb{R}^r\).

Solution. A section \(s:M\to M\times\mathbb{R}^r\) must satisfy \(\pi\circ s=\mathrm{id}_M\), so it has the form \[s(p)=(p,\psi(p))\] for a smooth \(\mathbb{R}^r\)-valued function \(\psi\).

Exercise 5 (Connection gauge transformation). Assume \(\psi_i=g_{ij}\psi_j\) and \(D_i\psi_i=g_{ij}D_j\psi_j\), with \(D_i=d+A_i\). Derive the transformation law for \(A\).

Solution. The computation in the connection section gives \[(dg_{ij})+A_i g_{ij}=g_{ij}A_j,\] so \[A_j=g_{ij}^{-1}A_i g_{ij}+g_{ij}^{-1}dg_{ij}.\]

Exercise 6 (Curvature for \(U(1)\)). Let \(A=A_xdx+A_ydy\) on \(\mathbb{R}^2\). Compute \(F=dA\).

Solution. \[\begin{align*} F&=d(A_xdx+A_ydy)\\ &=dA_x\wedge dx+dA_y\wedge dy\\ &=(\partial_y A_xdy)\wedge dx+(\partial_x A_y dx)\wedge dy\\ &=(\partial_x A_y-\partial_y A_x)dx\wedge dy. \end{align*}\]

Exercise 7 (Flow of a simple vector field). Let \(M=\mathbb{R}^2\) and \[X=x\frac{\partial}{\partial x}-y\frac{\partial}{\partial y}.\] Find the flow \(\Phi_t(x,y)\).

Solution. The flow equations are \[\dot x(t)=x(t), \qquad \dot y(t)=-y(t).\] With initial condition \((x(0),y(0))=(x,y)\), the solution is \[\Phi_t(x,y)=(e^tx,e^{-t}y).\]

Exercise 8 (Lie derivative from pullback). Let \(M=\mathbb{R}\) and \(X=d/dx\), whose flow is \(\Phi_t(x)=x+t\). For the function \(f(x)=x^2\), compute \(\mathcal{L}_Xf\) using \[\mathcal{L}_Xf=\left.\frac{d}{dt}\right|_0\Phi_t^*f.\]

Solution. Since \[\Phi_t^*f(x)=f(x+t)=(x+t)^2,\] we get \[\mathcal{L}_Xf=\left.\frac{d}{dt}\right|_0(x+t)^2=2x.\] This agrees with \(X[f]=d(x^2)/dx=2x\).

Exercise 9 (Lie derivative of a one-form). Let \(X=X^\mu\partial_\mu\) and \(\alpha=\alpha_\mu dx^\mu\). Use Cartan’s formula to verify \[(\mathcal{L}_X\alpha)_\mu=X^\nu\partial_\nu\alpha_\mu+\alpha_\nu\partial_\mu X^\nu.\]

Solution. Compute \(\iota_X\alpha=X^\nu\alpha_\nu\), so \[d(\iota_X\alpha)=\partial_\mu(X^\nu\alpha_\nu)dx^\mu.\] Also \[d\alpha=\partial_\mu\alpha_\nu dx^\mu\wedge dx^\nu,\] and contracting with \(X\) gives the remaining antisymmetric terms. Combining them cancels the unwanted term and gives \[(\mathcal{L}_X\alpha)_\mu=X^\nu\partial_\nu\alpha_\mu+\alpha_\nu\partial_\mu X^\nu.\]

Exercise 10 (Why the BZ is \(T^2\)). Explain why a two-dimensional Brillouin zone is topologically a torus.

Solution. Crystal momenta differing by reciprocal lattice vectors are identified: \[(k_x,k_y)\sim(k_x+G_x,k_y), \qquad (k_x,k_y)\sim(k_x,k_y+G_y).\] Thus each momentum direction is a circle, so the two-dimensional BZ is \(S^1\times S^1=T^2\).

Formula Sheet

Concept Formula
Topological manifold Locally homeomorphic to open subsets of \(\mathbb{R}^n\)
Smooth manifold Coordinate transition maps are smooth
Tangent basis \(\partial_\mu=\partial/\partial x^\mu\)
Cotangent basis \(dx^\mu(\partial_\nu)=\delta^\mu_\nu\)
One-form \(\alpha=\alpha_\mu dx^\mu\)
Exterior derivative \(d^2=0\)
Pullback of coordinate one-form \(f^*(dy^i)=d(f^i)=\frac{\partial f^i}{\partial x^a}dx^a\)
Flow equation \(\frac{d}{dt}\Phi_t(p)=X_{\Phi_t(p)}\), \(\Phi_0=\mathrm{id}_M\)
General Lie derivative \(\mathcal{L}_XT=\left.\frac{d}{dt}\right|_{0}\Phi_t^*T\)
Lie derivative of function \(\mathcal{L}_Xf=X^\mu\partial_\mu f\)
Lie derivative of vector field \(\mathcal{L}_XY=[X,Y]\)
Cartan formula \(\mathcal{L}_X\omega=\iota_Xd\omega+d\iota_X\omega\)
Fibre bundle \(\pi:E\to B\), locally \(\pi^{-1}(U_i)\cong U_i\times F\)
Vector bundle Fibre \(\mathbb{R}^r\) or \(\mathbb{C}^r\), transition functions in \(\mathop{\mathrm{GL}}(r)\)
Real line bundle Rank-\(1\) real vector bundle; fibre \(\mathbb{R}\); metric transition functions in \(O(1)=\{\pm1\}\)
Section \(s:B\to E\), \(\pi\circ s=\mathrm{id}_B\)
Transition functions \(g_{ij}:U_i\cap U_j\to G\), \(g_{ij}g_{jk}=g_{ik}\)
Connection \(\nabla=d+A\) locally
Gauge transformation \(A_j=g_{ij}^{-1}A_i g_{ij}+g_{ij}^{-1}dg_{ij}\)
Curvature \(F=dA+A\wedge A\)
\(U(1)\) curvature \(F=dA\)
First Chern number \(C=\frac{1}{2\pi}\int_\Sigma F\in\mathbb{Z}\)
Berry connection \(\mathcal A_{ab}=i\langle u_a|d u_b\rangle\)
Berry curvature \(\mathcal F=d\mathcal A\) for one occupied band
2D BZ \(\mathrm{BZ}\cong T^2=S^1\times S^1\)

Additional formulas added in the expanded version

\[\begin{align*} \mathop{\mathrm{Ad}}_gX &= gXg^{-1}\quad \text{for matrix Lie groups},\\ \nabla_X(e\psi)&=e\bigl(X[\psi]+A(X)\psi\bigr),\\ \nabla_{\partial_\mu}\partial_\nu&=\Gamma^\lambda{}_{\mu\nu}\partial_\lambda,\\ (\nabla_\mu Y)^\lambda&=\partial_\mu Y^\lambda+\Gamma^\lambda{}_{\mu\nu}Y^\nu,\\ F(X,Y)s&=\nabla_X\nabla_Ys-\nabla_Y\nabla_Xs-\nabla_{[X,Y]}s,\\ F_{\mu\nu}&=\partial_\mu A_\nu-\partial_\nu A_\mu+[A_\mu,A_\nu],\\ H^k_{\mathrm{dR}}(M)&=\frac{\{\omega\in\Omega^k(M):d\omega=0\}}{\{d\eta:\eta\in\Omega^{k-1}(M)\}},\\ C_1(L;\Sigma)&=\frac{1}{2\pi}\int_\Sigma F\in\mathbb{Z}. \end{align*}\]

Final Roadmap

For your current purpose, learn in this order:

  1. Topological manifolds and smooth coordinate changes.

  2. \(T_pM\), \(T_p^*M\), and \(dx^\mu(\partial_\nu)=\delta^\mu_\nu\).

  3. Differential forms as covariant antisymmetric tensors, not just integration notation.

  4. Pullback of forms and pushforward of vectors.

  5. Lie derivatives as flow-based derivatives.

  6. Topological fibre bundles and local trivializations.

  7. Vector bundles, sections, frames, and transition functions.

  8. Connections and curvature.

  9. First Chern number.

  10. Bloch bundle over the Brillouin zone.

Only after this should you worry about Chern–Simons actions, anomalies, index theorems, or K-theory.