1. Introduction
Studying the dynamical properties of discontinuous hyperbolic dynamical systems is important for understanding many relevant systems (such as billiards and optimal control theory) but it is difficult if the system is multi-dimensional. A first step towards addressing this problem can be to divide the problem into two cases: the piecewise expanding case and the piecewise contracting case. Here we address the case of piecewise contracting systems. Because the attractor can differ drastically depending on the kind of piecewise contraction, a first approach is to identify a large class of piecewise contractions that exhibit similar dynamical properties.
In recent decades, the properties of the attractor of piecewise contractions have been studied under different settings (see [Reference Bruin and Deane3, Reference Catsigeras, Guiraud and Meyroneinc7, Reference Catsigeras, Guiraud, Meyroneinc and Ugalde8, Reference Nogueira and Pires21, Reference Nogueira, Pires and Rosales22]). To begin with, Bruin and Deane (see [Reference Bruin and Deane3]) studied families of piecewise linear contractions on the complex plane ${\mathbb C}$ and proved that, for almost all parameters, each orbit is asymptotically periodic.
In the case of one-dimensional piecewise contractions, Nogueira and Pires, in [Reference Nogueira and Pires21], studied globally injective piecewise contractions on a half closed unit interval $[0,1)$ with partition of continuity consisting of n elements and concluded that such maps have at most n periodic orbits: that is, the attractor can be a Cantor set or a collection of at most n periodic orbits or a union of a Cantor set and at most $n-1$ periodic orbits. In particular, when the attractor consists of exactly n periodic orbits, the map is asymptotically periodic (the limit set of every element in the domain is a periodic orbit). They also proved that every such map on n intervals is topologically conjugate to a piecewise linear contraction of n intervals whose slope in absolute value equals $1/2$ . We would like to emphasise that this result does not imply that any two piecewise contractions close enough to each other are topologically conjugate to each other. Hence, this result is not a stability result for the class of piecewise contractions. In [Reference Gaivão and Nogueira12], piecewise increasing contractions on n intervals are considered and it is proved that the maximum number of periodic orbits is n. The authors prove that the collection parameters that give a piecewise contraction with non-asymptotically periodic orbits is a Lebesgue null measure set whose Hausdorff dimension is large or equal to n. In [Reference Nogueira, Pires and Rosales22], Nogueira, Pires and Rosales proved that generically (under ${\mathcal C}^0$ topology) globally injective piecewise contractions of n intervals are asymptotically periodic and have at least one and at most n internal periodic orbits (such orbits persist under a sufficiently small ${\mathcal C}^0$ -perturbation; refer to [Reference Nogueira, Pires and Rosales22] for precise definition). In [Reference Nogueira, Pires and Rosales24], the authors prove that almost all translations within a small neighbourhood of a $\lambda$ -affine contraction are asymptotically periodic. In [Reference Calderon, Catsigeras and Guiraud4], Calderon, Catsigeras and Guiraud proved that the attractor of a piecewise injective contraction consists of finitely many periodic orbits and minimal Cantor sets. In [Reference Janson and Oberg16, Reference Laurent and Nogueira20], the authors study symbolic coding associated to piecewise contractions on the unit interval and prove that they are related to the symbolic coding of rotations of the circle.
In higher dimensions, Catsigeras and Budelli (see [Reference Catsigeras and Budelli6]) proved that a finite dimensional piecewise contracting system with separation property (injective on the entire domain except for the discontinuity set) generically (under a topology that is finer than the topology we use for proving openness and coarser than the topology we use for proving density) exhibits at most a finite number of periodic orbits as its attractor. Here, we obtain similar, actually stronger, results without assuming the separation property.
In any (finite) dimension, in [Reference Catsigeras, Guiraud, Meyroneinc and Ugalde8], the authors show that if the set of discontinuities and the attractor of a piecewise contraction are mutually disjoint, then the attractor consists of finitely many periodic orbits. This result is a by-product of our arguments as well. In [Reference Gambaudo and Tresser14], the authors study symbolic dynamics associated to piecewise contractions (referred to as quasi-contractions in that article) and categorize its association into different kinds of circle rotations.
In the month after we submitted this article, we noticed a new preprint [Reference Gaivão and Pires13] in which the authors provide a measure-theoretical criterion for asymptotic periodicity of a parametrized family of locally bi-Lipschitz piecewise contractions on a compact metric space.
Nevertheless, the occurrence of chaotic behaviour in such systems has been addressed in the literature, and [Reference Kruglikov and Rypdal19] provides an example of a piecewise affine contracting map with positive entropy. The presence of a Cantor set in the attractor has also been studied rigorously. Examples of such maps for one dimension are given in [[Reference Catsigeras, Guiraud, Meyroneinc and Ugalde8], Example 4.3] and [Reference Coutinho9], and for three dimensions are given in [[Reference Catsigeras, Guiraud, Meyroneinc and Ugalde8], Example 5.1], where it is also proved that such a piecewise contraction turns out to have positive topological entropy. In [Reference Pires25], it is proved that, given a minimal interval exchange transformation with any number of discontinuities, there exists an injective piecewise contraction with Cantor attractors topologically semi-conjugate to it and, conversely, that piecewise contractions with Cantor attractors are topologically semi-conjugate to topologically transitive interval exchange transformations. Additionally, in [Reference Pires25] (respectively, in [Reference Catsigeras, Guiraud and Meyroneinc7]), it is proved that the complexity (the complexity function of the itineraries of orbits; refer to [Reference Catsigeras, Guiraud and Meyroneinc7, Reference Pires25]) of a globally injective piecewise contraction (respectively, piecewise contraction with separation property) on the interval is eventually affine (which is eventually constant in the case of piecewise contractions with no Cantor attractors).
The global picture presented by the above articles is that the Cantor attractors are rare, but can exist in exceptional (but constructible) cases, and many such explicit examples have been rigorously studied.
Piecewise contractions have also been used to study some models of outer billiards (see [Reference Del Magno, Gaivao and Gutkin10, Reference Gaivão11, Reference Jeong17]), where a billiard map is constructed such that it is a piecewise contraction, and so the properties of piecewise contractions are relevant in the study of such billiard maps.
Note that, in the above papers (and in this article), maps with only finitely many partition elements are considered.
The layout of this article is as follows.
Section 2 is dedicated to definitions, settings and the statement of results. In §3, we prove that the set of piecewise contractions with attractor disjoint from the set of discontinuities is open, under a rather coarse topology, and that if the maps are also piecewise injective (and hence not necessarily globally injective), then they are topologically stable. In §4, we prove that piecewise contractions with the attractor disjoint from the set of discontinuities are dense, under a rather fine topology, among the piecewise injective smooth contractions. Finally, we have three appendices collecting some needed technical facts.
2. Settings and results
Throughout this article, we work with $(X,d_0)\subset ({\mathbb R}^d,d_0)$ , a compact subset of ${\mathbb R}^d$ , where $d\in {\mathbb N}$ and $d_0$ is the standard Euclidean metric on ${\mathbb R}^d$ . Under these settings, we define a piecewise contraction as follows.
Definition 2.1. (Piecewise contraction)
A map $f:X\to X$ is called a piecewise contraction if $\overline {f(X)}\subset \mathring {X}$ and there exist $m\in {\mathbb N}$ and a collection ${\boldsymbol P}(f)=\{P_i: P_i=\mathring { P_i}\}_{i=1}^m$ of subsets of X such that:
-
• $X=\bigcup _{i=1}^{m}\overline P_i$ , where $ P_i\cap P_j=\emptyset $ whenever $i\neq j$ ;
-
• $f|_{P_i}$ is a uniform contraction, that is, there exists $\unicode{x3bb} \in (0,1)$ such that, for all $i\in \{1,2,\ldots ,m\}$ and for any $x,y\in P_i$ ,
$$ \begin{align*} d_0(f(x),f(y))\leq \unicode{x3bb} d_0(x,y); \end{align*} $$ -
• there exists a partition $\{\tilde P_i\}$ , $\tilde P_i\cap \tilde P_j=\emptyset $ for $i\neq j$ , $P_i\subset \tilde P_i\subset \overline {P}_i$ , $X=\bigcup _{i=1}^{m}\tilde P_i$ such that $f|_{\tilde P_i}$ is continuous.
Here $\unicode{x3bb} $ is called the ‘contraction coefficient’ of f, ${\boldsymbol P}(f)$ is called the ‘partition of continuity’ and $\overline {P_i}$ and $\partial P_i$ represent the closure and boundary, respectively, of a partition element $P_i$ .
Remark 2.2. The third condition pertaining to the partition $\tilde P$ states that the values of f on the boundary of partition elements must be the limit of the values of f inside some elements, but no particular condition is imposed on which element.
For a piecewise contraction f with partition of continuity ${\boldsymbol P}(f)=\{P_1, P_2, \ldots ,P_m\}$ , we define $\partial {\boldsymbol P}(f)$ as the boundary of the partition ${\boldsymbol P}(f)$ given by
We denote by $\Delta (f)$ the union of the set of discontinuities of f with $\partial X$ . To avoid confusion in the choice of partition of continuity for a given piecewise contraction, we define the following.
Definition 2.3. (Maximal partition)
A partition ${\boldsymbol P}(f)$ is called the maximal partition of a piecewise contraction f if $\partial {\boldsymbol P}(f)=\Delta (f)$ .
Any partition of continuity of f is a refinement of the maximal partition. For a piecewise contraction with maximal partition ${\boldsymbol P}(f)=\{P_1^1,P_2^1,\ldots ,P_m^1\}$ , we define the partition ${\boldsymbol P}(f^n)= \{P_1^n,P_2^n,\ldots ,P_{m_n}^n\}$ , relative to the nth iterate $f^n$ of f, where, for every $k\in \{1,2,\ldots ,m_n\}$ ,
and where $i_j\in \{1,2,\ldots ,m\}$ for every $j\in \{0,1,\ldots ,n-1\}$ . Note that ${\boldsymbol P}(f)$ being the maximal partition of f does not imply that the partition ${\boldsymbol P}(f^n)$ is the maximal partition of $f^n$ .
Remark 2.4. Throughout this article, for a given piecewise contraction f, the partition that we consider is the maximal partition ${\boldsymbol P}(f)$ , whereas for its iterates $f^n$ we consider the partition ${\boldsymbol P}(f^n)$ , as defined in equation (2.1), which may not be the maximal partition of $f^n$ .
One of the goals of this article is to understand the attractor of piecewise contractions. On that note, we recall the standard definition of an attractor.
Definition 2.5. (Attractor)
For a piecewise contraction f with ${\boldsymbol P}(f)=\{P_i\}_{i=1}^m$ , the attractor is defined as $\Lambda (f)=\bigcap _{n\in {\mathbb N}}\overline {f^n(X)}$ .
When working with discontinuous maps, it is natural to talk of Markov maps (maps with Markov partitions), so we first give the following definition and then the definition of Markov maps in our settings.
Definition 2.6. (Stabilization of partition)
For a piecewise contraction f, we say that the maximal partition ${\boldsymbol P}(f)$ stabilizes if there exists $N\in {\mathbb N}$ such that, for all $P\in {\boldsymbol P}(f^N)$ , there exists $Q\in {\boldsymbol P}(f^N)$ such that $\overline {f^N(P)}\subset Q$ . We call the least such number, N, the ‘stabilization time’ of ${\boldsymbol P}(f)$ .
Definition 2.7. (Markov map)
A piecewise contraction whose maximal partition stabilizes is called a Markov map.
Now we are able to state our first result (proved in §3).
Theorem 2.8. A piecewise contraction f with $\Lambda (f)$ as the attractor and $\Delta (f)$ as the union of the set of discontinuities and $\partial X$ satisfies that $\Lambda (f)\cap \Delta (f)=\emptyset $ if and only if it is Markov. Moreover, the attractor of a Markov map consists of periodic orbits.
Remark 2.9. Note that, given a Markov map f, N its stabilization time and ${\boldsymbol P}(f^N)=\{Q_1,\ldots , Q_l\}$ the associated partition, then f induces a dynamics on $\Omega (f):=\{1,\ldots , l\}$ by the rule . See Lemma 3.2.
To state further results, we need to add some hypotheses on our system, and thus we give the following definition.
Definition 2.10. (Piecewise injective contraction)
A piecewise contraction f with partition ${\boldsymbol P}(f)=\{P_i\}, i\in \{1,2,\ldots ,m\}$ is called a piecewise injective contraction if, for all ${i\in \{1,2,\ldots ,m\}}$ , there exists $U_i=\mathring U_i\supset \overline {P_i}$ and an injective contraction $\tilde f_i: U_i\to {\mathbb R}^d$ such that $\tilde f_i|_{U_i}\supset P_i$ and $\tilde f_i|_{P_i}=f|_{P_i}$ .
For any piecewise contractions $f,g $ with partitions ${\boldsymbol P}(f)=\{P_1,P_2,\ldots ,P_m\}$ and ${\boldsymbol P}(g)=\{Q_1,Q_2,\ldots ,Q_m\}$ , respectively, we define
We define the distance $\rho $ as follows (see Lemma 3.1 for the proof that $\rho $ is a metric).
where $A=2 \operatorname {diam}(X)$ and $\mathrm {id}$ is the identity function, that is, $\mathrm {id}(x)=x$ .
Evidently, $\rho (f,g)\leq A$ for any piecewise contractions $f,g$ . Furthermore, notice that the metric (proved in Lemma 3.1) $\rho $ is essentially a distance between two tuples $(f,{\boldsymbol P}(f))$ and $(g,{\boldsymbol P}(g))$ for any two piecewise contractions $f,g$ , where ${\boldsymbol P}(f)$ and ${\boldsymbol P}(g)$ are the maximal partitions of f and g, respectively.
For an arbitrary $\sigma \in (0,1)$ , we define the distance $d_1$ as
Under this metric, we have the following result (proved in §3).
Theorem 2.11. The set of Markov piecewise contractions is open in the set of piecewise contractions under the metric $d_1$ . Moreover, any two Markov piecewise contractions $f,g$ close enough (with respect to $d_1$ ), stabilize at the same time.
To be able to state our stability result, we need to discuss the dynamics of the partition elements.
Definition 2.12. (Wandering set)
For a piecewise contraction $f:X\to X$ , a partition element $P\in {\boldsymbol P}(f)$ is called wandering if there exists $M\in {\mathbb N}$ such that $f^n(P)\cap P=\emptyset $ for all $n>M$ . The set ${\mathbb W}(f)\subset {\boldsymbol P}(f)$ , consisting of all wandering partition elements, is called the wandering elements set. We set $W(f)=\bigcup _{P\in {\mathbb W}(f)}P$ . (Note that this set is not the usual wandering set, which is much bigger.)
Definition 2.13. (Non-wandering set)
The complement $N{\mathbb W}(f)$ of the wandering elements set is called the non-wandering elements set: that is, $N{\mathbb W}(f)= \{P\in {\boldsymbol P}(f): P\not \in {\mathbb W}(f)\}$ . We set $NW(f)=\bigcup _{P\in N{\mathbb W}(f)}P$ .
Note that, for a Markov map, the set $N{\mathbb W}(f)$ corresponds to the non-wandering set of the dynamical system defined in Remark 2.9. Accordingly, our definition of $NW(f)$ should not be confused with the usual non-wandering set of f, which, for a Markov map, consists of finitely many points (the set of periodic points; see Theorem 2.8). Hereby, we state our definition of topological stability.
Definition 2.14. (Topological stability)
Let $\mathfrak {C}$ be contained in the set of piecewise contractions from $X\to X$ and let $d $ be a metric defined on the set of piecewise contractions. We say that $(\mathfrak {C},d)$ is topologically stable if, for every $f\in \mathfrak {C}$ , there exists a $\delta>0$ such that, for any piecewise contraction $g\in {\mathfrak C}$ with $d(f,g)<\delta $ , f is semi-conjugate to g and g is semi-conjugate to f on the respective non-wandering sets: that is, there exist continuous functions $H_1: {NW}(f)\to {NW}(g),\ H_2:{NW}(g)\to {NW}(f)$ such that $H_1\circ f= g\circ H_1$ , $f\circ H_2= H_2 \circ g$ and $H_1(NW(f))=NW(g)$ , $H_2(NW(g))=NW(f)$ .
This definition of topological stability is somewhat different from the standard definition (see [[Reference Katok and Hasselblatt18], Definition 2.3.5]). In general, topological stability is defined for homeomorphisms such that one of them is semi-conjugate to the other (whereas we have semi-conjugacies for both sides), but on the entire space. Here it is necessary to define it only on the non-wandering set which, in fact, is the only subset of the space that contributes to the long-time dynamics. We can interpret our definition as stating that the long-time dynamics of two such functions are qualitatively the same.
Theorem 2.15. The set of Markov piecewise injective contractions (as defined in Definitions 2.7, 2.10) is topologically stable in the $d_1$ topology.
The proof of the above theorem is given in §3. To state our result on density, we restrict ourselves to piecewise smooth contractions, which are defined as follows.
Definition 2.16. (Piecewise smooth contraction)
A piecewise injective contraction f with partition ${\boldsymbol P}(f)=\{P_i\}, i\in \{1,2,\ldots ,m\}$ and injective extensions $\tilde f_i:U_i\to {\mathbb R}^d$ is called a piecewise smooth contraction if, for every $i\in \{1,2,\ldots ,m\}$ :
-
• $\|\tilde f_i\|_{{\mathcal C}^3}<\infty $ ;
-
• $\|\tilde f_i^{-1}\|_{{\mathcal C}^1}<\infty $ ; and
-
• $\partial P_i$ is contained in the union of finitely many ${\mathcal C}^2 (d-1)$ -dimensional manifolds $\{M_j\}$ , $\partial M_j\cap \overline P_i=\emptyset $ . Such manifolds are pairwise transversal, and the intersection of any set of such manifolds consists of a finite collection of ${\mathcal C}^2$ manifolds.
For a piecewise smooth contraction f, we define the extended-metric
Note that $d_2(f,g)\geq \rho (f,g)$ . The following density result is proved in §4.
Theorem 2.17. Piecewise smooth Markov contractions are $d_2$ -dense in the space of piecewise smooth contractions.
Remark 2.18. Theorems 2.8, 2.11 and 2.15 show that, for a piecewise contraction, to be Markov means being stable under a rather weak topology ( $d_1$ ). Instead, Theorem 2.17 shows that to be Markov means being dense under a quite strong topology ( $d_2$ ). These theorems collectively show that, for a piecewise contraction, being Markov is generic; hence, to have a Cantor set as an attractor is rare.
As already mentioned, a result on density is present in the literature (see [Reference Catsigeras, Guiraud, Meyroneinc and Ugalde8]). However, it is proved under a much coarser metric as compared with $d_2$ . More importantly, it assumes that the maps have the separation property, which implies that they are globally injective, whereas we assume only piecewise injectivity.
3. Openness and topological stability
Recall that, for any piecewise contraction f, ${\boldsymbol P}(f)$ stands for the maximal partition, so $\partial {\boldsymbol P}(f)=\Delta (f)$ . In addition, for any $n>1$ , the elements of partition ${\boldsymbol P}(f^n)$ are given by equation (2.1), and $\#{\boldsymbol P}(f^n)=m_n$ , $m_1=m$ .
Proof of Theorem 2.8
Let f satisfy $\Lambda (f)\cap \Delta (f)=\emptyset $ . For all $n\in {\mathbb N}$ , $\overline {f^{n+1}(X)}\subset \overline {f^{n}(f(X))}\subset \overline {f^n(X)}$ implies that $\{\overline {f^n(X)}\}$ is a nested sequence of non-empty compact sets. Further, Cantor’s intersection theorem implies that $\Lambda (f)=\bigcap _{n\in {\mathbb N}} \overline {f^n(X)}\neq \emptyset $ and it is closed. Accordingly, there exists $\varepsilon>0$ such that $d_0(\Lambda (f),\Delta (f))>\varepsilon $ . We claim that, for any $\varepsilon>0$ , there exists $N_{\varepsilon }\in {\mathbb N}$ such that, for $n\geq N_{\varepsilon }$ , $\overline {f^n(X)}\subset B_{\varepsilon /2}(\Lambda (f))$ . (For $r>0$ and a set A, $ B_r(A)=\{y: \text { there exists } x\in A\ \mathrm {such\ that}\ d_0(x,y)<r $ }.) Indeed, if this was not the case, then there would exist a sequence $\{n_j\}$ , $n_j\to \infty $ , such that $\overline {f^{n_j}(X)}\cap B_{\varepsilon /2}(\Lambda (f))^c\neq \emptyset $ . It follows that, for each $n\in {\mathbb N}$ , there exists j such that $n_j\geq n$ , and hence
which, taking the intersection on n, yields a contradiction. Consequently, for every ${P\in {\boldsymbol P}(f^{N_\varepsilon })}$ , there exists $Q \in {\boldsymbol P}(f^{N_{\varepsilon }})$ such that $\overline {f^{N_\varepsilon }(P)}\subset Q$ ; otherwise, there would exist $x\in f^{N_\varepsilon }(P)\cap \partial {\boldsymbol P}(f^{N_{\varepsilon }})$ , that is, a $k\in {\mathbb N}$ such that
which is a contradiction.
Conversely, let f be a piecewise Markov contraction with stabilization time $N\in {\mathbb N}$ . By definition, for every $P\in {\boldsymbol P}(f^N)$ , there exists $Q\in {\boldsymbol P}(f^N)$ such that $\overline {f^N(P)}\subset Q$ : that is, $\overline {f^N(P)}\cap \partial {\boldsymbol P}(f^N)=\emptyset $ . Note that $\overline {f(X)}=\bigcup _{i=1}^m\overline {f(\tilde P_i)}\subset \bigcup _{i=1}^m \overline {f(P_i)} $ , where $\tilde P_i$ is as given in Definition 2.1. Thus,
implies that $\Lambda (f)\cap \partial {\boldsymbol P}(f)=\emptyset $ .
To prove the second part of the theorem, let $x\in \Lambda (f)$ and let N be the stabilization time. By definition, for each $k\in {\mathbb N}$ , there exists $y_k\in Q_{k}\in {\boldsymbol P}(f^N)$ such that $x=f^{kN}(y_k)$ . In addition, there exists $P\in {\boldsymbol P}(f^N)$ such that $x\in P$ . This implies that ${\overline {f^{kN}(Q_k)}}\subset P$ for all $k\in {\mathbb N}$ . Let $l:=\# {\boldsymbol P}(f^N)$ . (For a discrete set $M,\ \# M$ denotes the cardinality of M.) Then there exists $k_1\in \{0,\ldots , l\}$ such that $P=Q_{k_1}$ and hence $\overline {f^{k_1N}(P)}\subset P$ . By the contraction mapping theorem, $f^{k_1N}:\overline {P}\to \overline {P}$ has a unique fixed point, say, $z\in \overline {P}$ . Let $j\in {\mathbb N}$ be the smallest integer for which $f^j(z)=z$ . Then $x\in \bigcap _{n\in {\mathbb N}}\overline { f^{nj}(P)}=\{z\}$ . Hence, $\Lambda (f)$ consists of periodic orbits.
To prove the result on openness and topological stability, we first prove that the functions $\rho ,\ d_1$ , defined in §2, are, in fact, metrics on the set of piecewise contractions. Note that, by definition of $H({\boldsymbol P}(f),{\boldsymbol P}(g))$ , if $\# {\boldsymbol P}(f)\neq \# {\boldsymbol P}(g)$ , then $H({\boldsymbol P}(f),{\boldsymbol P}(g))=\emptyset $ .
Lemma 3.1. $\rho $ is a metric.
Proof. Let $f,g,h$ be piecewise contractions.
If $\rho (f,g)=0$ , then there exists a sequence $\psi _n\in H({\boldsymbol P}(f),{\boldsymbol P}(g))$ , $\|\psi _n-\mathrm {id}\|_{{\mathcal C}^0(X,X)} \to 0$ and $\|f-g\circ \psi _n\|_{{\mathcal C}^0(X,X)}\to 0$ as $n\to \infty $ , which implies that $\psi _n\to \mathrm {id}$ as $n\to \infty $ which further implies that ${\boldsymbol P}(f)\hspace{-1pt}=\hspace{-1pt}{\boldsymbol P}(g)$ . Also, $\psi _n\hspace{-1pt}\to\hspace{-1pt} \mathrm {id}$ means that $f\hspace{-1pt}-\hspace{-1pt}g\circ \psi _n\hspace{-1pt}\to\hspace{-1pt} f\hspace{-1pt}-\hspace{-1pt}g$ , and hence $f=g$ .
Next, we check the symmetry of $\rho $ . If $H({\boldsymbol P}(f),{\boldsymbol P}(g))\kern1.3pt{=}\kern1.3pt\emptyset $ , then ${\rho (f,g)\kern1.3pt{=}\kern1.3pt\rho (g,f)\kern1.3pt{=}\kern1.3pt A}$ . If $H({\boldsymbol P}(f),{\boldsymbol P}(g))\neq \emptyset $ , then
It remains to check the triangle inequality.
To show that $\rho (f,g)\leq \rho (f,h)+\rho (g,h)$ , consider the following cases.
If $H({\boldsymbol P}(f),{\boldsymbol P}(g))=\emptyset $ , then $\rho (f,g)=A$ , and $H({\boldsymbol P}(f),{\boldsymbol P}(h))=\emptyset $ or/and $H({\boldsymbol P}(g), {\boldsymbol P}(h))=\emptyset $ : that is, either one of the two or both are empty sets. Therefore, $\rho (f,h)=A$ or $\rho (g,h)=A$ and so $\rho (f,g)=A \leq \rho (f,h)+\rho (h,g)$ .
If $H({\boldsymbol P}(f),{\boldsymbol P}(g))\neq \emptyset $ , then there are the following two possibilities.
-
(1) If $H({\boldsymbol P}(f),{\boldsymbol P}(h))=\emptyset $ and $H({\boldsymbol P}(h), {\boldsymbol P}(g))=\emptyset $ , then $\rho (f,h)+\rho (g,h)\geq A$ and
$$ \begin{align*} \hspace{1.1cm}\rho(f,g)& =\inf\limits_{\psi\in H(P(f),P(g))} \{\|\psi-\mathrm{id}\|_{{\mathcal C}^0(X,X)}+\|f-g\circ\psi\|_{{\mathcal C}^0(X,X)}\}\\ &\leq2 \operatorname{diam}(X). \end{align*} $$Since $A\geq 2 \operatorname {diam}(X)$ , we have the result.
-
(2) $H({\boldsymbol P}(f),{\boldsymbol P}(g))\neq \emptyset $ , and $H({\boldsymbol P}(g),{\boldsymbol P}(h))\neq \emptyset $ .
Given $\phi \in H({\boldsymbol P}(g),{\boldsymbol P}(h))$ and $\varphi \in H({\boldsymbol P}(f),{\boldsymbol P}(h))$ , the homeomorphism ${\psi =\phi ^{-1}\circ \varphi \in H({\boldsymbol P}(f), {\boldsymbol P}(g))}$ , and hence
$$ \begin{align*} \rho(f,g)&=\inf\limits_{\psi\in H({\boldsymbol P}(f), {\boldsymbol P}(g))}\{\|\psi-\mathrm{id}\|_{{\mathcal C}^0(X,X)}+ \|f-g\circ\psi\|_{{\mathcal C}^0(X,X)}\}\\[2pt] &\leq \inf\limits_{\varphi\in H({\boldsymbol P}(f),{\boldsymbol P}(h))}\inf\limits_{\phi\in H({\boldsymbol P}(g),{\boldsymbol P}(h))} \{\|\phi^{-1}\circ\varphi-\mathrm{id}\|_{{\mathcal C}^0(X,X)}\\[2pt] &\quad +\|f-h\circ\varphi\|_{{\mathcal C}^0(X,X)} +\|h\circ\varphi-g\circ\phi^{-1}\circ\varphi\|_{{\mathcal C}^0(X,X)}\} \\[2pt] &\leq\! \inf\limits_{\varphi\in H({\boldsymbol P}(f),{\boldsymbol P}(h))}\inf\limits_{\phi\in H({\boldsymbol P}(g),{\boldsymbol P}(h))} \{\|\phi^{-1}-\mathrm{id}\|_{{\mathcal C}^0(X,X)}+\|f-h\circ\varphi\|_{{\mathcal C}^0(X,X)}\\[2pt] &\quad +\|\varphi-\mathrm{id}\|_{{\mathcal C}^0(X,X)}+\|h\circ\varphi-g\circ\phi^{-1}\circ\varphi\|_{{\mathcal C}^0(X,X)}\} \\[2pt] &=\inf\limits_{\phi\in H({\boldsymbol P}(g),{\boldsymbol P}(h))} \{\|\phi^{-1}-\mathrm{id}\|_{{\mathcal C}^0(X,X)}+\|h\circ\varphi-g\circ\phi^{-1}\circ\varphi\|_{{\mathcal C}^0(X,X)}\}\\[2pt] &\quad + \inf\limits_{\varphi\in H({\boldsymbol P}(f),{\boldsymbol P}(h))}\{\|\varphi-\mathrm{id}\|_{{\mathcal C}^0(X,X)}+\|f-h\circ\varphi\|_{{\mathcal C}^0(X,X)}\}\\[2pt] &=\rho(f,h)+\rho(g,h). \end{align*} $$
Hence, $\rho (\cdot ,\cdot )$ is a metric.
Lemma 3.1 implies that $d_1$ is also a metric since the series is convergent.
Proof of Theorem 2.11
Let $f $ be Markov. We want to prove that there exists a neighbourhood of f consisting of only Markov contractions. Since f is Markov, there exists $N\in {\mathbb N}$ such that the maximal partition ${\boldsymbol P}(f)$ of f stabilizes with stabilization time N. Let $g\neq f$ be a piecewise contraction such that $d_1(f,g)<\delta $ for $0<\delta < \sigma ^N A$ (recall that $A=\operatorname {diam}(X)$ ).
We show that the partition of g stabilizes with the same stabilization time N.
Note that, for all $n\leq N$ , $\rho (f^n,g^n)<A$ which implies that $H({\boldsymbol P}(f^n),{\boldsymbol P}(g^n))\neq \emptyset $ . Let $P\in {\boldsymbol P}(g^N)$ . Then there exist $\psi _N\in H({\boldsymbol P}(f^N),{\boldsymbol P}(g^N))$ and $P^\prime \in {\boldsymbol P}(f^N)$ such that $\psi _N(P^\prime )~=~P$ . By stabilization, for $P^{\prime }$ , there exists a unique $Q^\prime \in {\boldsymbol P}(f^N)$ such that $\overline {f^N(P^\prime )}\subset Q^\prime $ . In addition, $\psi _N(Q^\prime )=Q$ for some $Q\in {\boldsymbol P}(g^N)$ . Now, $\overline {f^N(P^\prime )}\subset Q^\prime $ and $Q^\prime $ being open implies that $\varepsilon =\min _{P^\prime \in {\boldsymbol P}(f^N)}d_0(\overline {f^N(P^\prime )}, \partial Q^\prime )>0$ . Choosing $0<\delta <\varepsilon \sigma ^N/3$ , we claim that $\overline {g^N(P)}\subset Q$ . Indeed, if there exists $x\in \overline {g^N(P)}\cap \partial Q$ , then there exists a sequence $\{x_k\}\in g^N(P)\cap Q$ such that $x_k\to x$ as $k \to \infty $ . Let $y_k\in P$ such that $g^N(y_k)=x_k$ . Note that $\psi _N^{-1}(x)\in \partial Q^\prime $ . Now,
and we get $\varepsilon <\varepsilon /3$ , which is a contradiction. Hence, if f is Markov with N as the stabilization time, then, for
all piecewise contractions g, with $d_1(f,g)<\delta $ , are Markov contractions and the stabilization time of g is also N. Thus, the collection of Markov contractions is open.
To prove Theorem 2.15, we first need to prove the following lemma which, in itself, brings some important information about the dynamics of a Markov contraction.
Lemma 3.2. Let f be a Markov contraction with maximal partition ${\boldsymbol P}(f)$ and stabilization time N. Then, for every $P\in {\boldsymbol P}(f^N)$ , there exists $Q\in {\boldsymbol P}(f^N)$ such that $f(P)\subset Q$ .
Proof. By Definition 2.6, there exists $P^\prime \in {\boldsymbol P}(f^N)$ such that $f^N(P)\subset P^\prime $ . By the definition given in equation (2.1), there exist partition elements $\{P_i\}_{i=0}^{N-1}$ (not necessarily distinct) in ${\boldsymbol P}(f)$ such that
Similarly, there exist partition elements $\{P_j^\prime \}_{j=1}^{N-1}$ in ${\boldsymbol P}(f)$ such that
Let $x\in P$ . Then $f^k(f(x))\in P_{k+1}$ for every $k\in \{0,1,\ldots ,N-2\}$ and $f^N(x)= f^{N-1}f(x)\in P^\prime $ , which implies that $f(x)\in f^{-(N-1)}(P^\prime _0)$ . Consequently,
Since $x\in P$ is arbitrary, $f(P)\subset Q\in {\boldsymbol P}(f^N)$ .
Finally, to prove Theorem 2.15, we restrict to piecewise injective contractions and prove the stability result under the metric $d_1$ . Recall that, for every piecewise injective contraction f, with maximal partition ${\boldsymbol P}(f)=\{P_1,\ldots ,P_m\}$ , there exists $U_i=\mathring U_i\supset \overline {P_i}$ and an injective continuous extension $\tilde {f}:U_i\to {\mathbb R}^d$ such that $\tilde f|_{P_i}=f|_{P_i}$ . In this proof, we always consider these extensions which, to alleviate notation, we still denote by f.
Our strategy for the following proof is as follows. For piecewise injective Markov contractions f and g with maximal partitions ${\boldsymbol P}(f)$ and $ {\boldsymbol P}(g)$ , respectively, and for some $\delta>0$ , $d_1(f,g)<\delta $ , we construct semi-conjugacies from $NW(f)$ , the non-wandering set for f, to $NW(g)$ , the non-wandering set for g, using a homeomorphism between the partitions given in the definition of the metric $\rho $ .
Proof of Theorem 2.15
Let $N\in {\mathbb N}$ be the stabilization time of ${\boldsymbol P}(f)$ . By Theorem 2.11, there exists $\delta>0$ such that, if $d_1(f,g)<\delta $ , then ${\boldsymbol P}(g)$ also stabilizes at time N. By Theorem 2.8, the attractors of f and g consist of eventually periodic orbits. Let $P\in {\boldsymbol P}(f^N)$ be a periodic element of the partition. Then there exists $n_0\in {\mathbb N}$ such that $\overline {f^{n_0}(P)}\subset P$ . Then $d_1(f,g)<\delta $ implies that there exists $Q\in {\boldsymbol P}(g^N)$ such that $\overline {g^{n_0}(Q)}\subset Q$ . By the contraction mapping theorem, for $f^{n_0}: \overline {P}\to \overline {P}, g^{n_0}: \overline {Q}\to \overline {Q}$ , there exist $x_f$ and $x_g$ , the unique fixed points of $f^{n_0}$ and $g^{n_0}$ , respectively, in P and Q.
Using Lemma 3.2 inductively, let $P_i,\ Q_i$ be the partition elements in ${\boldsymbol P}(f^N), {\boldsymbol P}(g^N)$ , respectively, such that $ f^i(\overline P)\subset P_i $ , $ g^i(\overline Q)\subset Q_i$ . Let $\widehat {P}= \bigcup _{i=1}^{n_0} {\overline P_i}$ , $ \widehat {Q}=\bigcup _{i=1}^{n_0}{\overline Q_i}$ . Then, for each $i\in {\mathbb N}$ , $\widehat {P}_i=f^i(\widehat {P})\subset \widehat {P}$ , $\widehat {Q}_i=g^i(\widehat {Q})\subset \widehat {Q} $ .
Note that, for $\delta $ small enough, we have $ H({\boldsymbol P}(f^n),{\boldsymbol P}(g^n))\neq \emptyset $ for every $n\leq n_0$ : that is, there exists $\psi \in H({\boldsymbol P}(f^{n_0}),{\boldsymbol P}(g^{n_0}))$ such that $\|\psi -\mathrm {id}\|_{{\mathcal C}^0(X,X)}<\delta $ and $\|f^n-g^n\circ \psi \|_{{\mathcal C}^0(X,X)}<\delta $ , and thus $\psi (\widehat {P})=\widehat {Q}$ .
Next, define $\widehat g=\psi \circ f\circ \psi ^{-1}$ . Since, by Definition 2.10, $f^{-1}$ is well defined on $\widehat P$ , we have that $\widehat g$ is invertible on $\widehat Q$ and $\widehat g^{-1}(\partial \widehat Q)=\psi \circ f^{-1}( \partial \widehat P)$ .
Next, for $\varepsilon>0$ small enough, let $\widehat P_{1, \varepsilon }$ be the $\varepsilon $ -neighbourhood of $f(\widehat P)$ and let $\widehat P_{c,\varepsilon }$ be the $\varepsilon $ -neighbourhood of ${\widehat P}^\complement $ , the complement of $\widehat P$ . Similarly, let $\widehat Q_{c,\varepsilon }=\psi \circ f^{-1}(\widehat P_{c,\varepsilon })$ and $\widehat Q_{1,\varepsilon }=\psi \circ f^{-1}(\widehat P_{1,\varepsilon })$ . By the Markov property, we have $\overline {\widehat P}_{1,\varepsilon }\cap \overline {\widehat P}_{c,\varepsilon }=\emptyset $ , and thus $\overline {\widehat Q}_{1,\varepsilon }\cap \overline {\widehat Q}_{c,\varepsilon }=\emptyset $ . Hence, by Urysohn’s lemma, there exists a function $\theta \in {\mathcal C}^0(X,[0,1])$ such that $\theta |_{\overline {\widehat Q}_{1, \varepsilon }}=1$ and $\theta |_{\overline {\widehat Q}_{c,\varepsilon }}=0$ . Finally, define the continuous functions
Lemma 3.3. Provided $\delta>0$ is small enough, we have $h_0(\widehat P\setminus \widehat P_1)=\widehat Q\setminus \widehat Q_1$ , $h_0(\partial \widehat P)=\partial \widehat Q$ , $h_0(\partial \widehat P_1)=\partial \widehat Q_1$ .
Proof. First, using the properties of the homeomorphism $\psi $ , we have, for each $x\kern1.2pt{\in}\kern1.2pt \overline {\widehat P\setminus \widehat P_1}$ ,
If $x\in \widehat P_{c,\varepsilon }\cap \widehat P$ , then $\psi \circ f^{-1}(x)\in \widehat Q_{c,\varepsilon }$ , and thus $h_0(x)=\widehat g\circ \psi \circ f^{-1}(x)=\psi (x)$ . Whereas, if $x\in \widehat P_{1,\varepsilon }$ , then $\psi \circ f^{-1}(x)\in \widehat Q_{1,\varepsilon }$ , and thus $h_0(x)=g\circ \psi \circ f^{-1}(x)$ . In addition,
Also, if $\delta $ is small enough, then $h_0$ is invertible on $(\widehat P_{c,\varepsilon }\cup \widehat P_{1, \varepsilon })\cap \widehat P$ . Thus, to prove surjectivity, it suffices to prove that each $p\in (\widehat Q\setminus \widehat Q_1)\setminus h_0((\widehat P_{c,\varepsilon }\cup \widehat P_{1, \varepsilon })\cap \widehat P)$ belongs to $h_0(\widehat P\setminus \widehat P_1)$ . Let $B=\{z\in {\mathbb R}^{n}\;:\; \|z\|\leq 3\delta \}$ , $x=p+z$ and $h_0(x-p)=x-h_0(x)$ . Then $h_0(x)=p$ is equivalent to
Since equation (3.1) implies that $h_0(B)\subset B$ , by Brouwer’s fixed-point theorem it follows that there exists at least one $z\in B$ such that $h_0(z+p)=p$ and, for $\delta \leq \varepsilon /6$ , $z+p \in \widehat P\setminus \widehat P_1$ .
For the sake of convenience, let $\widehat P_0=\widehat P$ and $\widehat Q_0=\widehat Q$ . For every $i\in {\mathbb N}$ , we define ${h_{i}:\overline {\widehat {P}_i\backslash \widehat {P}_{i+1}}\to \overline {\widehat {Q}_i\backslash \widehat {Q}_{i+1}}}$ as $h_{i}(x)=g\circ h_{i-1}\circ f^{-1}(x)$ . Thus, we define the semi-conjugacy $H:\overline {\widehat {P}}\to \overline {\widehat {Q}}$ as
H is continuous because, for $x\in \partial \widehat {P}_i, H(x)=h_{i+1}(x)=g\circ h_i\circ f^{-1}(x)=h_i(x)$ and, for any sequence $(x_i)\in \widehat {P}_i\backslash \widehat {P}_{i+1}$ with $x_i\to x_f$ as $i\to \infty $ , $(Hx_i)\in \widehat {Q}_i\backslash \widehat {Q}_{i+1}$ , so $H(x_i) \to x_g=H(x_f)$ . Indeed, H is surjective, for $p\in \overline {\widehat {Q}}$ , and there exists $i\in {\mathbb N}\cup \{0\}$ such that $p\in \overline {\widehat Q_i\setminus \widehat Q_{i+1}}$ . We use induction on i. If $i=0$ , then, using Lemma 3.3, there exists $z\in \overline {\widehat Q_i\setminus \widehat Q_{i+1}}$ such that $h_0(z)=p$ . Instead, if $i\neq 0$ , then $p_1=g^{-1}(p)\in \overline {\widehat Q_{i-1}\setminus \widehat Q_{i}}$ . Inductively, as $h_{i-1}$ is surjective, there exists $q_1\in \overline {\widehat P_{i-1}\setminus \widehat P_{i}}$ such that $h_{i-1}(q_1)=p_1$ . Now, by definition, $q=f(q_1)\in \overline {\widehat P_{i}\setminus \widehat P_{i+1}}$ , and finally ${g\circ h_i\circ f^{-1}(q)=p}$ .
Furthermore, H is the wanted semi-conjugacy between $f,g$ , on $\widehat {P}$ and $\widehat {Q}$ because, for $x\in \widehat {P}$ ,
To obtain a semi-conjugacy on the whole of $NW(f)$ , we repeat the same steps for every periodic partition element $P\in {\boldsymbol P}(f^N), Q\kern1.2pt{\in}\kern1.2pt {\boldsymbol P}(g^N)$ with the same ${\psi \kern1.2pt{\in}\kern1.2pt H({\boldsymbol P}(f^N),{\boldsymbol P}(g^N))}$ . Calling $\{\widehat {P}^k\}$ the collection of the union of elements associated to a periodic orbit, we have $\bigcup _k \widehat {P}^k=NW(f)$ . Pasting these functions together, we obtain a function $\Theta :{NW}(f)\to {NW}(g)$ with $\Theta (x)=H_{\widehat {P}^k}(x)$ for $ x\in \widehat {P}^k\subset NW(f)$ . Then $\Theta $ is continuous on $NW(f)$ as $\Theta |_{\partial \widehat {P}^k}=\psi |_{\partial \widehat {P}^k}$ for every k.
To construct the semi-conjugacy from the other side, we use the fact that $\psi $ is a homeomorphism and repeat the same construction using $\psi ^{-1}$ instead of $\psi $ and switching the roles of f and g.
4. Density
We prove Theorem 2.17 in two steps. First, we show that Markov maps are dense in a special class of systems (piecewise strongly contracting) and then we will show that such a class is itself dense in the collection of piecewise smooth contractions.
Definition 4.1. (Piecewise strongly contracting)
A piecewise smooth contraction f with contraction coefficient $\unicode{x3bb} $ and maximal partition ${\boldsymbol P}(f)=\{P_1,P_2,\ldots ,P_m\}$ is said to be piecewise strongly contracting if there exists $p\in {\mathbb N}$ such that $\unicode{x3bb} ^pm_p<1/2$ , where ${m_p=\#{\boldsymbol P}(f^p)}$ .
We prove the following results.
Proposition 4.2. Markov maps are $d_2$ -dense in the collection of piecewise strong contractions.
Proposition 4.3. Piecewise strong contractions are $d_2$ -dense in the collection of piecewise smooth contractions.
These two propositions readily imply our main result.
Proof of Theorem 2.17
Let f be a piecewise smooth contraction. By Proposition 4.3, for each $\varepsilon>0$ , there exists a piecewise strong contraction $f_1$ such that $ d_2(f,f_1)<\varepsilon /2$ . In addition, by Proposition 4.2, there exists a piecewise smooth Markov contraction $f_2$ such that $ d_2(f_1,f_2)<\varepsilon /2$ , and hence the result.
In the rest of the paper, we prove Propositions 4.2 and 4.3.
The basic idea of the proof is to introduce iterated function systems (IFSs) associated with the map. The attractor of the IFS is greater than the one of the map (see §4.1 for the relationship between the two sets), and hence if we can prove that the attraction of the IFS is disjoint from the discontinuities of the map, so will be the attractor of the map. The advantage is that, in this way, the study of the boundaries of the elements of ${\boldsymbol P}(f^n)$ is reduced to the study of the pre-images of the discontinuities of f under the IFS. Hence, we can iterate smooth maps rather than discontinuous ones.
This advantage is first exploited in §4.2, where we prove Proposition 4.2 using an argument that is, essentially, a quantitative version of Sard’s theorem.
To prove Proposition 4.3, the rough idea is to use a transversality theorem (see Appendix B) to show that if a lot of pre-images intersect, then, generically, their intersection should have smaller and smaller dimensions until no further intersection is generically possible. Unfortunately, if we apply a transversality theorem to a composition of maps of the IFS, we get a perturbation of the composition and not of the single maps. How to perturb the single maps in such a way that the composition has the wanted properties is not obvious.
Our solution to this problem is to make sure that if we perturb the maps in a small neighbourhood B, and we consider arbitrary compositions of the perturbed maps, then all the images of B along the composition never intersect B. Hence, if we restrict the composition to B, all the maps, except the first, will behave as their unperturbed version. To ensure this, it suffices to prove that such compositions have no fixed points near the singularity manifolds (such an implication is proved in Lemma 4.10). To this end, in Propositions 4.11 and 4.17, we show that one can control the location of the fixed points of the compositions of the map of the IFS by an arbitrarily small perturbation.
After this, we can finally set up an inductive scheme to ensure that the pre-images of the discontinuity manifolds keep intersecting transversally. This is the content of Proposition 4.19 from which Proposition 4.3 readily follows.
4.1. IFSs associated to the map and their properties
We start by recalling the definition of an IFS relevant to our argument and exploring some of its properties.
Definition 4.4. (IFS)
The set $\Phi =\{\phi _1, \phi _2, \ldots ,\phi _m\}, m\geq 2$ , is an IFS if each map ${\phi _i:{\mathbb R}^d\to {\mathbb R}^d}$ is a Lipschitz contraction. (In the following, we will consider only ${\mathcal C}^3$ maps $\phi _i$ .)
Let f be a piecewise smooth contraction with maximal partition ${\boldsymbol P}(f)$ . By analogy with [Reference Nogueira, Pires and Rosales23], we define an IFS associated to f as follows.
By Definition 2.16, $\tilde f|_{U_i}$ is ${\mathcal C}^3$ for every i and $D\tilde f|_{U_i}\leq \unicode{x3bb} <1$ . Using the ${\mathcal C}^r$ version of Kirszbraun–Valentine theorem A.2, we obtain a ${\mathcal C}^3$ extension $\phi _i:{\mathbb R}^d\to {\mathbb R}^d$ of $\tilde f|_{U_i}$ , and hence of $f|_{P_i}$ , so that $\|D\phi _i\|\leq \unicode{x3bb} <1$ for all $i\in \{1,2,\ldots ,m\}$ . We denote a ${\mathcal C}^3$ IFS associated to f by
Remark 4.5. Unfortunately, it is not obvious if one can obtain an extension in which $\phi _i$ are invertible. This would simplify the following arguments as one would not have to struggle to restrict the discussion to the sets $U_i$ (e.g., see (4.11)). However, since ${\mathcal C}^\infty ({\mathbb R}^d,{\mathbb R}^d)$ finite to one maps are generic by Tougeron’s theorem (see [[Reference Golubitsky and Guillemin15], Theorem 2.6, pp. 169]), we can assume, by an arbitrarily small perturbation of f, that the $\phi _i$ are finite to one.
For $m,n\in {\mathbb N}$ , let $\Sigma ^m_n=\{1,2,\ldots ,m\}^n$ and $\Sigma ^m=\{1,2,\ldots ,m\}^{{\mathbb N}}$ be the standard symbolic spaces. We endow $\Sigma ^m$ with the metric $d_\gamma $ for some $\gamma>1$ :
In addition, let $\tau :\Sigma ^m\to \Sigma ^m$ be the left subshift: $\tau (\sigma _1,\sigma _2,\sigma _3, \ldots )= (\sigma _2,\sigma _3,\ldots )$ .
Set $K\kern1.3pt{=}\kern1.3pt\max \{\|x\|\;:\;x\in X\}$ ( $\|\cdot \|$ is the general Euclidean norm). Let $M\kern1.3pt{=}\kern1.3pt\sup _i\|\phi _i(0)\|$ . Then, for each $y\in {\mathbb R}^d$ ,
Thus, setting
we have $\phi _i(Y)\subset Y$ , for all $i\in \{1,\ldots , m\}$ , $X\subset Y $ and Y is a d-dimensional manifold with boundary.
Next, define $\Theta _{\! f}:\Sigma ^m\to Y$ as
where $\sigma =(\sigma _1, \sigma _2,\ldots ,)\in \Sigma ^m$ . The sets $\{\phi _{\sigma _1}\circ \phi _{\sigma _2}\circ \cdots \phi _{\sigma _n}(Y)\}_{n\in {\mathbb N}}$ form a nested sequence of compact subsets of Y. In addition, $\operatorname {diam}({\phi _{\sigma _1}\circ \phi _{\sigma _2}\circ \cdots \circ \phi _{\sigma _n}(Y)})\to 0$ as $n\to \infty $ , so, by Cantor’s intersection theorem, $\Theta _{\! f}(\sigma )$ is a single element in Y for every $\sigma \in \Sigma $ , which implies that $\Theta _{\! f}$ is well defined. We define the attractor of the IFS $\Phi _{\! f}$ as
Lemma 4.6. The function $\Theta _{\! f}:\Sigma ^m\to Y$ is continuous. In turn, $\Lambda (\Phi _{\! f})$ is compact.
Proof. For given $\varepsilon>0$ , there exists $k\in {\mathbb N}$ such that $\unicode{x3bb} ^k\operatorname {diam}(Y)<\varepsilon $ . Let $\delta =\gamma ^{-k}$ , where $\gamma $ is the one in (4.1). For $\sigma ,\sigma ^\prime \in \Sigma ^m$ , $d_{\gamma }(\sigma , \sigma ^{\prime })<\delta =\gamma ^{-k}$ implies that, for all $i<k$ , $\sigma _i\!=\sigma ^{\prime }_i$ , and hence, for any $x,y\in Y$ ,
and hence the continuity with respect to $\sigma $ . Since $\Sigma ^m$ is compact, the attractor $\Lambda (\Phi _{\! f})$ is compact as it is the continuous image of a compact set.
Remark 4.7. Let $\Phi _{\! f}=\{\phi _1,\ldots ,\phi _m\}$ be an IFS associated to a piecewise contraction f. For $p\in {\mathbb N}$ , let ${\boldsymbol P}(f^p)=\{P^p_1,P^p_2,\ldots , P^p_{m_p}\}$ be the partition of $f^p$ given as in equation (2.1). Define the corresponding IFS associated to $f^p$ as
where, for all $i\in \{1,2,\ldots ,m_p\}$ , there exists unique $\sigma ^i=(\sigma ^i_1,\ldots ,\sigma ^i_p)\in \Sigma _p^m$ (uniquely determined by the partition element $P_i^p\in {\boldsymbol P}(f^p)$ ) such that
The attractor of $\Phi _{f^p}$ is $\Lambda (\Phi _{f^p})= \Theta _{\! f^p}(\Sigma ^{m_p})$ . To avoid confusion, we denote the elements in $\Sigma ^m$ by $\sigma $ and the elements in $\Sigma ^{m_p}$ by $\omega $ .
Lemma 4.8. For a piecewise smooth contraction f with IFS $\Phi _{\! f}$ and for $p\in {\mathbb N}$ , the following relationship holds.
Proof. For $p\in {\mathbb N}$ , let ${\boldsymbol P}(f^p)=\{P_1^p,P_2^p,\ldots ,P_{m_p}^p\}$ and let $m_1=m$ . We start by proving the first inclusion, that is $\Lambda (f)\subset \Lambda (f^p)$ . Let $x\in \Lambda (f)= \bigcap _{n\in {\mathbb N}}\overline {{f}^n(X)}$ , that is, for every $n\in {\mathbb N}$ , $x\in \overline {f^n(X)}$ . Accordingly,
Thus, $\Lambda (f)\subset \Lambda (f^p)$ . For the third inclusion, that is, $\Lambda (\Phi _{f^p})\subset \Lambda (\Phi _{\! f})$ , let $\Phi _{f^p}=\{\varphi _1,\varphi _2,\ldots ,\varphi _{m_p}\}$ . Then, for every $\omega =(\omega _1,\omega _2,\ldots )\in \Sigma ^{m_p}$ there exists $\sigma ^{\omega }=(\sigma ^{\omega _1},\sigma ^{\omega _2}, \ldots )$ , where $\sigma ^{\omega _i}=(\sigma ^{\omega _i}_1,\ldots ,\sigma ^{\omega _i}_p)\in \Sigma ^{m}_p$ , such that
Thus, $\Lambda (\Phi _{f^p})\!=\! \bigcup _{\omega \in \Sigma ^{m_p}}\Theta _{\! f^p}(\omega )\!=\!\bigcup _{\{\sigma ^\omega \;:\;\omega \in \Sigma ^{m_p}\}}\Theta _{\! f}(\sigma ^\omega )\subset \bigcup _{\sigma \in \Sigma ^{m}}\Theta _{\! f}(\sigma )=\Lambda (\Phi _{\! f}) $ . Finally, for the second inclusion, let $x\in \Lambda (f^p)=\bigcap _{n\in {\mathbb N}}\overline {f^{pn}(X)}$ . Then, for all $n\in {\mathbb N}$ , there exists $y_n\in X$ such that $d_0(x,f^{pn}(y_n))<1/n$ . By definition of $\Phi _{f^p}$ , there exists $\omega ^n=(\omega _1^n,\omega _2^n,\ldots ,\omega _n^n,\ldots )\in \Sigma ^{m_p}$ such that $f^{pn}(y_n)=\varphi _{\omega _1^n}\circ \cdots \circ \varphi _{\omega _n^n}(y_n)$ , where $\varphi _{\omega _i^n}\in \Phi _{f^p}$ . This implies that, for all $n\in {\mathbb N}$ , $d_0(x,\varphi _{\omega _1^n}\circ \cdots \circ \varphi _{\omega _n^n}(y_n))<1/n$ . By compactness of $\Sigma ^{m_p}$ , there exists a subsequence $\{n_k\}$ and $\omega \in \Sigma ^{m_p}$ such that $\omega ^{n_k}\to \omega $ . Since, by Lemma 4.6, $\Theta _{\! f^p}$ is continuous,
Hence, by definition of the attractor, $x\in \Lambda (\Phi _{f^p})$ .
4.2. A simple perturbation and the proof of Proposition 4.2
For $\delta \in {\mathbb R}^d$ with ${|\delta |>0}$ sufficiently small, and a piecewise contraction f with IFS $\Phi _{\! f}=\{\phi _1,\phi _2,\ldots ,\phi _m\}$ , we define perturbations $f^{\delta }, \Phi _{f^\delta }$ as
Provided $|\delta |$ is small enough, the perturbation $f^\delta $ satisfies $\overline {f^\delta (X)}\subset \mathring X$ , and hence $f^\delta $ is a piecewise smooth contraction with corresponding IFS $\Phi _{f^\delta }=\{\phi ^\delta _1,\phi ^\delta _2,\ldots ,\phi ^\delta _m\}$ . One can easily check that $d_2(f,f^\delta )=|\delta |$ . By definition (see (4.3)), $\Theta _{\! f^\delta }:\Sigma ^m\to Y$ reads
and the respective attractor is ${\Lambda }(\Phi _{f^\delta })= \bigcup _{\sigma \in \Sigma ^m}\Theta _{\! f^{\delta }}(\sigma )$ .
Observe that, for any $p\in {\mathbb N}$ , the corresponding IFS associated to $(f^{\delta })^p$ is given by $\Phi _{(f^\delta )^p}=\{\varphi _1^\delta , \varphi _2^\delta , \ldots , \varphi _{m_p}^\delta \}$ , where, for every $i\in \{1,2,\ldots ,m_p\}$ , there exists $\sigma _i\in \Sigma ^m_p$ such that $\varphi _i^\delta = \phi _{\sigma _1^i}^\delta \circ \phi _{\sigma _2^i}^\delta \circ \cdots \circ \phi _{\sigma _p^i}^\delta $ .
Lemma 4.9. The map $\Theta _{\! f^{\delta }}(\sigma )\mapsto \bigcap _{n \in {\mathbb N}}{\phi _{\sigma _1}^{\delta }\circ \phi _{\sigma _2}^{\delta }\circ \cdots \circ \phi _{\sigma _n}^{\delta }(Y)}$ is uniformly Lipschitz continuous in $\delta $ : that is, there exists $a>0$ such that, for all $\sigma \in \Sigma ^m$ , $d_0(\Theta _{\! f^{\delta }}(\sigma ), \Theta _{\! f^{\delta ^{\prime }}}(\sigma )) \leq a d_0(\delta ,\delta ^{\prime })$ .
Proof. Let $\delta ,\delta ^\prime>0$ , $n\in {\mathbb N}$ , $x\in Y$ and $\sigma =(\sigma _1,\sigma _2,\ldots ,\sigma _n,\ldots )$ . Then
Iterating the above argument yields
and letting $a=1/(1-\unicode{x3bb} )$ concludes the proof.
Proof of Proposition 4.2
Let $p\in {\mathbb N}$ be such that $m_p\unicode{x3bb} ^p\leq \tfrac 12$ . Lemma 4.8 asserts that $\Lambda (f^\delta )\subset \Lambda (\Phi _{(f^\delta )^p})$ . Hence, by Theorem 2.8, it suffices to prove that, for every $\varepsilon>0$ small enough, there exists $\delta \in B_{\varepsilon }(0)$ such that the attractor $\Lambda (\Phi _{(f^\delta )^p})$ is disjoint from $\partial {\boldsymbol P}(f^\delta )=\partial {\boldsymbol P}(f)$ .
Suppose, to the contrary, that, for every $\delta \in B_{\varepsilon }(0)$ , $\Lambda (\Phi _{(f^\delta )^p})\cap \partial {\boldsymbol P}(f)\neq \emptyset $ . Accordingly, there exists $\omega (\delta )\in \Sigma ^{m_p}$ for which $\Theta _{\! (f^\delta )^{p}}(\omega (\delta ))\in \partial {\boldsymbol P}(f)$ . By definition, $\partial {\boldsymbol P}(f)=\bigcup _{P\in {\boldsymbol P}(f)}\partial P$ . Therefore, there exists $P_i\in {\boldsymbol P}(f)$ and $A\subset B_{\varepsilon }(0)$ with $\mu _d(A)\geq \mu _d(B_{\varepsilon }(0))/m = C_d\varepsilon ^d/m$ ( $\mu _d$ is the d-dimensional Lebesgue measure) such that, for all $\delta \in A,\ \Theta _{\! (f^\delta )^p}(\omega (\delta ))\in \partial P_i $ .
Moreover, for each $k\in {\mathbb N}$ , there exist $\omega ^*= (\omega ^*_1, \omega ^*_2,\ldots ,\omega ^*_k)\in \Sigma ^{m_p}_k$ such that the set defined as
is non-empty and $\mu _d(A_k(\omega ^*))\geq \mu _d(A)/m_p^k\geq C_d\varepsilon ^dm^{-1}{m_p}^{-k}$ . Accordingly, for $\omega (\delta )\in A_k(\omega ^*)$ and the IFS associated to $\Phi _{(f^\delta )^p}=\{\varphi _1^\delta ,\varphi _2^\delta ,\ldots ,\varphi _{m_p}^\delta \}$ ,
where $\tau $ is the left shift as defined above and $\varphi _{\omega _j^*}^\delta = \phi ^\delta _{\sigma ^p_{j,1}}\circ \phi _{\sigma ^p_{j,2}}^\delta \circ \cdots \circ \phi _{\sigma ^p_{j,p}}^\delta $ , for $\omega ^*_j=(\sigma ^p_{j,1},\sigma ^p_{j,2},\ldots ,\sigma ^p_{j,p})\in \Sigma ^m_p$ with $\phi ^\delta _{\sigma ^p_{j,s}}\in \Phi _{f^\delta }$ .
Next, for some $\bar {x}\in X$ , define $\theta :A_k(\omega ^*)\to X$ as $\theta (\delta )=\varphi _{\omega ^*_1}^{\delta }\circ \varphi _{\omega ^*_2}^{\delta }\circ \cdots \circ \varphi _{\omega ^*_k}^{\delta }(\bar {x})$ . Then
Thus, $\theta (\delta )$ belongs to a $B_*\unicode{x3bb} ^{pk}$ neighbourhood of $\partial P_i$ . Since $\partial P_i$ is contained in the union of finitely many ${\mathcal C}^2$ manifolds, the Lebesgue measure of a $B_*\unicode{x3bb} ^{pk}$ neighbourhood of $\partial P_i$ is bounded above by $C\unicode{x3bb} ^{pk} \mu _{d-1}(\partial P_i)$ for a fixed constant $C>0$ . Accordingly,
On the other hand,
where, by definition of $\theta (\delta )$ , . Note that
where $\|\cdot \|$ is the standard operator norm defined as $\|L\|=\sup _{\|v\|=1}\|Lv\|$ for any linear operator $L:{\mathbb R}^d\to {\mathbb R}^d$ . It follows that, for all $v\in {\mathbb R}^d$ ,
since $\unicode{x3bb} ^p\leq {1}/{2m^p}$ and $m^p\geq 2$ . Hence, the eigenvalues of $D\theta (\delta )$ are larger, in modulus, than $\tfrac 12$ . Accordingly, $|\kern-2.5pt \det (D\theta (\delta ))|\geq 2^{-d}$ and
which, for k large enough, is in contradiction with (4.6), and this concludes the proof.
4.3. Fixed points in a generic position
Fix $N\in {\mathbb N}$ . For all $q\leq N+1$ and ${\sigma =(\sigma _1,\ldots ,\sigma _q)\in \Sigma ^m_{q}(\Phi )}$ , let $x_\sigma (\Phi )$ be the unique fixed point of $\phi _{\sigma _1}\circ \cdots \circ \phi _{\sigma _q}$ : that is,
The goal of this section is to define a perturbation that puts the above fixed points in a generic position. We start with the following trivial but useful fact concerning the location of such fixed points.
Lemma 4.10. Given an IFS $\Phi $ , if for some $y\in {\mathbb R}^d, \delta>0$ , $p\in {\mathbb N}$ and $\sigma \in \Sigma ^m_p$ ,
then the unique fixed point of $\phi _{\sigma _1}\circ \cdots \circ \phi _{\sigma _p}$ belongs to $ B_{c_*\delta }(y)$ , $c_*= {2}/({1-\unicode{x3bb} })>2$ .
Proof. The fact that $\|\phi _{\sigma _1}\circ \cdots \circ \phi _{\sigma _p}(y)-y\|\leq 2\delta $ implies that, for each $x\in B_{c_*\delta }(y)$ ,
Hence, $\phi _{\sigma _1}\circ \cdots \circ \phi _{\sigma _p}(B_{c_*\delta }(y))\subset B_{c_*\delta }(y)$ . The lemma follows by the contraction mapping theorem.
Let ${\mathcal A}_p=\{\phi _{\sigma _{1}}\circ \cdots \circ \phi _{\sigma _p}\;:\;\sigma \in \Sigma ^{m}_{p}\}$ and $A_p(\Phi ):=\{x_\sigma (\Phi )\;:\; \sigma \in \Sigma ^m_{p}(\Phi )\}$ and let $\#A_{N}(\Phi )=k_\star $ . We can now explain what we mean by having the fixed points in a generic position.
Proposition 4.11. For each $\varepsilon>0$ , there exists an $\varepsilon $ -perturbation $\Phi ^0=\{\phi ^0_k\}$ of $\Phi $ such that, for all $q\leq p\leq N$ , $\sigma \in \Sigma ^m_{q}(\Phi )$ , $\omega \in \Sigma ^m_{p}(\Phi )$ , $\omega \neq \sigma $ , if $\sigma _{p+1}\neq \omega _q$ , then $x_{\sigma }(\Phi ^0)\neq x_{\omega }(\Phi ^0)$ , whereas if $\sigma _{p+1}=\omega _q$ and $x_{\sigma }(\Phi ^0)= x_{\omega }(\Phi ^0)$ , then $\phi _{\sigma _{1}}\circ \cdots \circ \phi _{\sigma _q}$ and $\phi _{\omega _{1}}\circ \cdots \circ \phi _{\omega _p}$ are both some power of a $\Theta \in \bigcup _{s=1}^p {\mathcal A}_s$ .
Proof. We proceed by induction on p. If $\sigma _1,\omega _1\in \{1,\ldots ,m\}$ and $x_{\sigma _1}(\Phi )=x_{\omega _1}(\Phi )$ , then we can simply make the perturbation $\tilde \phi _{\sigma _1}(x)=\phi _{\sigma _1}(x)+\eta $ for some $\|\eta \|<\varepsilon /2$ . This proves the statement for $p=1$ . To simplify the notation, we keep calling $\Phi $ also the perturbed IFS. We suppose that the statement is true for p, after a perturbation of size at most $(1-2^{-p})\varepsilon $ , and we prove it for $p+1$ .
Since $A(p+1)=\{x_\sigma (\Phi )\;:\; \sigma \in \Sigma ^m_{p+1,*}(\Phi )\}$ is a finite discrete set,
Let $\delta \leq c_*^{-2}\delta ^*_{p+1}/2$ (recall that $c_*= {2}/({1-\unicode{x3bb} })$ ). Suppose that $z:=x_\sigma (\Phi )=x_\omega (\Phi )$ and $\omega _{q}=\sigma _{p+1}$ , where $\sigma \in \Sigma ^m_{p+1}(\Phi )$ , $\omega \in \Sigma ^m_{q}(\Phi )$ , $q\leq p+1$ and $\sigma \neq \omega $ . Let ${j\in \{0,\ldots , q-1\}}$ be the largest integer such that $\sigma _{p+1-j}=\omega _{q-j}$ . If $j=q-1$ , then it must be that $q\leq p$ ; otherwise, we would have $\omega =\sigma $ . Hence,
That is, $\phi _{\sigma _1}\circ \cdots \circ \phi _{\sigma _{p+1-q}}(z)=z$ . It follows, by the inductive hypothesis, that there exist $k\in {\mathbb N}$ and $\Theta \in {\mathcal A}_s$ , $s\leq q$ , such that
so $\phi _{\sigma _1}\circ \cdots \circ \phi _{\sigma _{p+1}}=\Theta ^{k+j}$ , as claimed. It remains to consider the case $j<q-1$ . Let $\psi := \phi _{\omega _{q-j}}\circ \cdots \circ \phi _{\omega _q}$ . Then $\psi =\phi _{\sigma _{p+1-j}}\circ \cdots \circ \phi _{\sigma _{p+1}}$ and
where, by construction, $\omega _{q-j-1}\neq \sigma _{p-j}$ . By renaming the indices, we are thus reduced to the case $\omega _{q}\neq \sigma _{p+1}$ . The following lemma is useful for analysing this case.
Sub-lemma 4.12. If, for $j\in \{1,\ldots , p\}$ ,
then $\phi _{\sigma _{j+1}}\circ \cdots \circ \phi _{\sigma _{p+1}}(z)=z$ and $\sigma _j=\sigma _{p+1}$ , and the same for $\omega $ .
Proof. Lemma 4.10 implies that there exists $z_1\in B_{c_*\delta }(z)$ such that $\phi _{\sigma _{j+1}}\circ \cdots \circ \phi _{\sigma _{p+1}}(z_1)=z_1$ . In addition,
Thus, $\phi _{\sigma _{1}}\circ \cdots \circ \phi _{\sigma _{j}}(B_{c_*\delta }(z))\cap B_{c_*\delta }(z)\neq \emptyset $ , and hence Lemma 4.10 implies that there exists $z_2\in B_{c_*^2\delta }(z)$ such that $\phi _{\sigma _{1}}\circ \cdots \circ \phi _{\sigma _{j}}(z_2)=z_2$ . The definition of $\delta $ implies that $z_1=z_2$ , which, in turn, implies that $z_1=z$ . Hence,
and, by the inductive hypothesis, this is possible only if $\sigma _{j}=\sigma _{p+1}$ . The argument for $\omega $ is identical.
If there exists $j\in \{1,\ldots , p\}$ and $k\in \{1,\ldots , q\}$ such that
then, by Sub-Lemma 4.12, $\phi _{\sigma _{j+1}}\circ \cdots \circ \phi _{\sigma _{p+1}}(z)=z=\phi _{\omega _{k+1}}\circ \cdots \circ \phi _{\omega _{q}}(z)$ , which contradicts our inductive hypothesis. Thus, if the first inequality is satisfied for some j, the second cannot be satisfied for any k, and vice versa. Suppose that there does not exist k for which the second inequality of (4.9) is satisfied (the other possibility being completely analogous).
Define the perturbation
where, by analogy with (4.18), for $v\in {\mathbb R}^d$ , $\|v\|=1$ ,
Note that $\|h_{z,\delta }-\mathrm {id}\|_{{\mathcal C}^2}\leq 2^{-p-1}\varepsilon _0$ . Since $h_{z,\delta }(B_{c_*^{-1}\delta }(z))\subset B_{c_*^{-1}\delta }(z)$ , it follows that the effect of the perturbation is always confined to $B_{\delta }(z)$ and its images. Moreover, by Sub-Lemma 4.12, if there exists j such that $\phi _{\sigma _{j+1}}\circ \cdots \circ \phi _{\sigma _{p+1}}(B_\delta (z))\cap B_\delta (z)\neq \emptyset $ , we have seen that $\sigma _{j}=\sigma _{p+1}\neq \omega _q$ , so next we apply a map, $\phi _{\sigma _j}$ , that has not been modified. On the other hand, if $x\in B_\delta (z)$ and for some k we have $\omega _k=\omega _{q}$ , we have $\phi _{\omega _{k+1}}\circ \cdots \circ \phi _{\omega _q}(x)\not \in B_\delta (z)$ . Next, we apply $\phi _{\omega _k}$ outside the region where it has been modified. It follows that, for each $x\in B_\delta (z)$ ,
Calling $z(\delta )$ the unique fixed point of $\phi _{\omega _{1}}\circ \cdots \circ \phi _{\omega _{q}}\circ h_{z,\delta }$ ,
Since $z(0)=z$ and
we have that it is not possible that $z(\delta )=z$ for all $\delta \leq c_*^{-2}\delta ^*_{p+1}/2$ . Thus, we can make a perturbation for which the two fixed points are different, and, for $\delta $ small, they cannot be equal to the other fixed points.
For any other couple of elements $\sigma \in \Sigma ^m_{p+1}(\Phi ),\omega \in \Sigma ^m_{q}(\Phi )$ , we can repeat the same process and obtain the perturbation with two different fixed points, as above. Note that, as the size of the perturbation is $\delta <c_*^{-1}\delta ^*_{p+1}/2$ , the distance between the newly obtained fixed points in $\bigcup _{q\leq p}{\mathcal A}_q$ stays positive as the perturbation does not move the fixed points more than $\delta ^*_{p+1}/2$ .
From now on, we assume that $\Phi $ satisfies Proposition 4.11.
4.4. Pre-images of the boundary manifolds and how to avoid them
Next, we need some notation and a few lemmata to describe the structure of the pre-images of the discontinuity manifolds conveniently. This allows us to develop the tools to prove Proposition 4.3.
Let f be a piecewise smooth contraction with ${\boldsymbol P}(f)=\{P_1, P_2,\ldots , P_m\}$ and ${\Phi _{\! f}=\{\phi _1,\phi _2,\ldots ,\phi _m\}}$ . Recall that, by hypothesis, $\partial {\boldsymbol P}(f)$ is contained in the finite union of ${\mathcal C}^2$ manifolds, which we will call boundary manifolds. Let $l_0$ be the number of boundary manifolds in $\partial {\boldsymbol P}(f)$ . Recall also that, for every $i\in \{1,2,\ldots ,m\}$ , $U_{i}$ is the open neighbourhood of $P_{i}\in {\boldsymbol P}(f)$ such that $\tilde f|_{U_{i}}$ is injective and hence invertible. Accordingly, by the construction of $\Phi _{\! f}$ , we have that $\phi _{i}|_{U_{i}}$ has a well-defined inverse for all $i\in \{1,2,\ldots ,m\}$ .
Let $\epsilon _0=\min \{ d_H(P_i,U_i^c)\;:\; i\in \{1,\ldots ,m\}\}$ , where the complement is taken in ${\mathbb R}^d$ . For each $\epsilon \leq \epsilon _0/2$ , we can consider the $\epsilon $ -neighbourhood $V_i$ of $P_i$ and the $\epsilon /2$ neighbourhood $V_i^-$ .
Choosing $\epsilon $ small enough, we can describe the boundary manifolds by embeddings $\psi _i\in {\mathcal C}^2(D_i^+, {\mathbb R}^d)$ , $i\in \{1,2,\ldots ,l_0\}$ , such that $\overline {\psi _i(D^+_i)}\subset U_{p}$ for some $p\in \{1,\ldots ,m\}$ , and there exists an open set $D_i\subset D^+_i\subset {\mathbb R}^{d-1} $ such that $\psi _i(D_i)\cap V_{p}\neq \emptyset $ and $\partial \psi _i(D_i)\cap \overline V_{p}=\emptyset $ (this is possible by Definition 2.16). For each IFS $\Phi $ and $\sigma \in \Sigma ^m_n$ , recalling the definition (4.2) of Y, we let
We call a sequence $\sigma $ admissible if $D_{\sigma }(\Phi )\neq \emptyset $ . We define the set $\Sigma ^m_{n,i}$ of the i-admissible sequences as
Remark 4.13. Note that, for each $N \in {\mathbb N}$ , there is a $\delta>0$ such that, for each ${i\in \{1,\ldots , l_0\}}$ , $n\leq N$ , admissible word $\sigma \in \Sigma ^m_{n,i}$ , point $x\in D^+_{\sigma }(\Phi _{\! f}\!)$ and small enough perturbations $\tilde \Phi =\{\tilde \phi _i\}$ of $\Phi $ , we have, for each $j\leq n$ , $\tilde \phi _{\sigma _j}\circ \cdots \circ \tilde \phi _{\sigma _n}(B_{\delta }(x))\subset \tilde \phi _{\sigma _{j}}(U_{\sigma _j})$ , so that the inverse function $\tilde \phi _{\sigma _n}^{-1}\circ \cdots \circ \tilde \phi _{\sigma _1}^{-1}$ is well defined on $\tilde \phi _{\sigma _1}\circ \cdots \circ \tilde \phi _{\sigma _n}(B_{\delta }(x))$ .
Remark 4.14. By the definition (2.1) of the partition ${\boldsymbol P}(f^n)$ , it follows that
where $\Sigma ^m_{0,i}(\Phi )=\{0\}$ and $\phi _0=\mathrm {id}$ , so, for $\sigma \in \Sigma ^m_{0,i}(\Phi )$ , we have $D_{\sigma ,i}(\Phi _{\! f}\!)=\psi _i(D_i)$ .
Unfortunately, the sets $D_{\sigma ,i}(\Phi _{\! f}\!)$ may have a rather complex topological structure, whereas we would like to cover $\partial {\boldsymbol P}(f^n)$ with a finite set of $(d-1)$ -dimensional manifolds described by a single chart. This is our next task.
In the following, we will write $\Sigma _{n,i}^m$ only if it does not create confusion. In addition, we set
Note that, if $\sigma \in \Sigma _{n,*}^m$ and $x\in D_{\sigma ,i}(\Phi )$ , then there exists a unique $y\in D_i$ such that $\phi _\sigma (x):=\phi _{\sigma _1}\circ \cdots \circ \phi _{\sigma _n}(x)=\psi _i(y)$ , so the following is well defined: $\psi _i^{-1}\circ \phi _{\sigma }(D_{\sigma ,i}(\Phi ))=W_{\sigma ,i}$ . Note that $W_{\sigma ,i}$ is a compact set. For all $y\in D_i$ , let $A(y)$ be the set of $\sigma \in \Sigma ^m_{p,i}$ , $p\leq n$ , such that $y\in W_{\sigma ,i}$ . As noted in Remark 4.13, there exists a $\delta (y)>0$ such that $\psi _i(B_{\delta (y)}(y))\subset \bigcap _{\sigma \in A(y)}\phi _{\sigma }(D_{\sigma ,i}(\Phi ))$ . For a fixed $N\in {\mathbb N}$ , we have that $\{B_{\delta (y)}(y)\;:\; y\in \overline {D_i}\}$ is a Besicovitch cover and, by the Besicovitch covering theorem, we can obtain a subcover in which each point can belong to at most $c_d$ balls (for some $c_d$ depending only on the dimension d). We can then extract a finite subcover
of $\overline {D_{i}}$ . We set, for $\sigma \in \Sigma _{p,i}^{m}$ with $p\in \{0,\ldots ,N\}$ , $M^N_{\sigma ,i}(\Phi )=\{\phi _{\sigma _p}^{-1}\circ \cdots \circ \phi _{\sigma _1}^{-1}\circ \psi _i(B_{\delta (y_k)}(y_k))\}$ , $M^N_{0,i}= \{\psi _i(D_i)\}$ , which is the wanted collection of $(d-1)$ -dimensional manifolds. Note that they are not necessarily disjoint. However, they have the wanted property, as the following remark states.
Remark 4.15. By the definition given by equation (2.1) of the partition ${\boldsymbol P}(f^n)$ , it follows that, for each $N\geq n$ ,
where $\Sigma ^m_{0,i}=\{0\}$ and $\phi _0=\mathrm {id}$ , so, for $\sigma \in \Sigma ^m_{0,i}$ , we have $M_{\sigma ,i}(\Phi _{\! f}\!)=\psi _i(D_i)$ .
Also, for all IFS $\Phi $ and $N\in {\mathbb N}\cup \{0\}$ , we define
In addition, for $\delta>0$ , we define the closure of the $\delta $ -neighbourhood of $D^{N}(\Phi _{\! f}\!)$ as
Remark 4.16. The basic idea of the proof is to make a perturbation such that the images of $\partial {\boldsymbol P}(f^n)$ do not self-intersect too many times. This can be done easily for a single map $\phi _i$ . However, we are dealing with compositions in which the same map can appear many times. So we have to avoid the possibility that the perturbation at one time interferes with itself at a later time. This can be achieved if there are no fixed points close to the pre-images of the singularities. This is our next task.
Proposition 4.17. Let $\Phi =\{\phi _1,\phi _2,\ldots ,\phi _m\}$ be an IFS with contraction coefficient $\unicode{x3bb} $ . For each $N\in {\mathbb N}$ and $\varepsilon>0$ small enough, there exists an IFS $\tilde \Phi =\{\tilde \phi _1,\ldots ,\tilde \phi _m\}$ and $\delta _\varepsilon \in (0,\varepsilon )$ such that $\|\phi _i-\tilde \phi _i\|_{{\mathcal C}^2}\leq \varepsilon $ , and, for any $p\leq N$ , $\sigma =(\sigma _1,\ldots ,\sigma _p)\in \Sigma ^{m}_{p,*}(\Phi )$ , we have that $\tilde \phi _{\sigma _1}\circ \cdots \circ \tilde \phi _{\sigma _p}|_{D_\sigma (\tilde \Phi )}$ is invertible. Moreover, $x\in D_{\delta _\varepsilon }^{N}(\tilde \Phi )$ implies that $\tilde \phi _{\sigma _1}\circ \cdots \circ \tilde \phi _{\sigma _p}(x)\neq x$ . Finally, there exists $c_*>2$ such that, for any $\delta \in (0,c_*^{-1}\delta _\varepsilon /2)$ and $y\in D_{\delta _{\varepsilon }/2}^{N}(\tilde \Phi )$ ,
Proof. The last statement of the proposition is an immediate consequence of the first part and Lemma 4.10. As for the first part, note that if, for each $\sigma \in \Sigma ^m_{p, *}$ , $p\leq N$ , the fixed points of $\phi _\sigma :=\phi _{\sigma _1}\circ \cdots \circ \phi _{\sigma _p}$ do not belong to $D^{N}(\tilde \Phi )$ , then the proposition holds with $\delta _\varepsilon $ small enough. Thus, it suffices to prove the latter fact.
By Remark 4.13, it follows that there exists $\varepsilon _0>0$ such that, for each $\varepsilon \in (0,\varepsilon _0)$ and $\varepsilon $ -perturbation $\tilde \Phi $ of $\Phi $ , for all $\sigma \in \Sigma ^m_{p,*}(\Phi )$ , $p\leq N$ , the inverse map of $\phi _\sigma $ is well defined in $D_\sigma (\tilde \Phi )$ . In addition, by Lemma C.3, $\Sigma ^m_{p,*}(\Phi )=\sigma ^m_{p,*}(\tilde \Phi )$ . From now on we assume that $\varepsilon \leq \varepsilon _0$ .
We can then apply Proposition 4.11 to obtain the IFS $\Phi ^0$ , a $\varepsilon /4$ perturbation of $\phi $ , where the fixed points differ unless they are associated with sequences composed by the repetition of the same word. Next, we want to proceed by induction on the sequences in $\Sigma ^m_{N,*}(\Phi )$ .
To this end, it is necessary to have an order structure on $\Sigma ^m_{N,*}(\Phi )$ . We introduce the following order: $0\prec \sigma $ for all $\sigma \neq 0$ , and if $p>q$ and $\sigma \in \Sigma ^m_{p,*}(\Phi )$ , $\sigma '\in \Sigma ^m_{q,*}(\Phi )$ , then $\sigma '\preceq \sigma $ . If $p=q$ , then the $\sigma $ are ordered lexicographically. This is a total ordering, and hence we can arrange them as sequences $\{\sigma ^i\}_{i\in {\mathbb N}}$ with $\sigma ^j\prec \sigma ^i$ if and only if $i>j$ . Next, define $\ell (j)$ to be the length of the word $\sigma ^j$ : that is $\sigma ^j\in \Sigma ^{m}_{\ell (j),*}$ . Recall the definition of fixed points (4.8). It is convenient to set $\sigma ^0=0$ and $x_0(\phi )=\emptyset $ . Also, let $\Lambda _0=\max \{\|(D_x\phi _i)^{-1}\|_{{\mathcal C}^0(V_i)}\;:\; \phi _i\in \Phi ^0\}$ .
The idea is to define a sequence of perturbations $\Phi ^k$ , $\|\Phi ^{k+1}-\Phi ^{k}\|_{{\mathcal C}^2}=\varepsilon _k\leq \varepsilon 2^{-k-1}$ , such that, for all $j\leq k$ ,
Note that the above implies that there exists $\Lambda>2$ such that
for all $k\in {\mathbb N}$ . In particular, the above implies that
Using the notation introduced just before Proposition 4.11, let
We proceed by induction on $\sigma $ . For $\sigma ^0$ , the statement in the induction is trivially true. We assume that it is true for $\sigma ^k$ and we prove the statement for $\sigma ^{k+1}$ . Let
We consider $\varepsilon _k$ -perturbations of $\Phi ^k$ with $\varepsilon _k\leq \Lambda ^{-N}\delta _k/4$ . For $\sigma \in \Sigma ^m_p$ , we use the notation $\phi _{\sigma }=\phi _{\sigma _1}\circ \cdots \circ \phi _{\sigma _p}$ . If $x_{\sigma ^{k+1}}(\Phi ^k)\not \in D^{N}(\Phi ^k)$ , then we set $\Phi ^{k+1}=\Phi ^k$ and the induction step is satisfied. Otherwise, as before, for some $a>2$ , and any $\delta \in (0,1/ \sqrt a), v\in V$ and $\bar x\in {\mathbb R}^d$ , we define $h_{\bar x,\delta ,v}:{\mathbb R}^d\to {\mathbb R}^d$ as
where $g\in {\mathcal C}^\infty ({\mathbb R},{\mathbb R}_+)$ is a monotone function such that $g(y)=0$ for all $y\leq 0$ , $g(y)=1$ for all $y\geq 1/2$ and $\|g'\|_\infty < a$ . Let $p=\ell (\sigma ^{k+1})$ . For each $\delta>0$ and $v\in {\mathbb R}^d$ , $\|v\|\leq 1$ , we consider the perturbations $\Phi _{\delta , v}=\{\phi _{i,\delta , v}\}$ defined by
where $\Phi ^k=\{\phi _i\}$ . Note that $\|\phi _i-\phi _i\circ h_{\bar x,\delta ,v}\|_{{\mathcal C}^2}\leq C_g\delta $ and $\|\phi _i\circ h_{\bar x,\delta ,v}\|_{{\mathcal C}^3}\leq C_g$ for some constant $C_g>1$ . Thus, these are $\varepsilon _k$ perturbations provided $\delta \leq C_g^{-1}\varepsilon _k\leq C_g^{-1}\Lambda ^{-N}\delta _k/4$ .
Lemma 4.18. There exist $ C_*>0$ and $\delta _*\in (0, \min \{C_g^{-1}\varepsilon _k, \Lambda ^{-2N}\})$ such that, for all $\delta \leq \delta _*$ , $\|v\|\leq 1$ and each $j\leq k+1$ ,
Moreover, $\partial _vx_{\sigma ^j}(\Phi _{\delta , v})$ is invertible and
Proof. Let $q=\ell (\sigma ^j)$ . If $\sigma ^j_s\neq \sigma ^{k+1}_p$ , for all $s\leq q$ , then $x_{\sigma ^{j}}(\Phi ^k)=x_{\sigma ^j}(\Phi _{\delta , v})$ . Otherwise, let $\bar s$ be the largest such that, for all $s\leq \bar s$ , $\sigma ^j_s\neq \sigma ^{k+1}_p$ ,
Note that, $(\sigma ^j_{\bar s},\ldots ,\sigma ^j_{q},\sigma ^j_1,\ldots , \sigma ^j_{\bar s-1},)=\sigma ^{j_1}$ for some $j_1<k+1$ . Moreover, ${\sigma ^{j}_q=\sigma ^{k+1}_p}$ and
By hypothesis,
We can thus consider only the case in which $\sigma ^{j}_q=\sigma ^{k+1}_p$ . Let $1\leq \bar s<q$ be the largest integer, if it exists, such that $\sigma ^j_s= \sigma ^{k+1}_p$ for all $s\leq \bar s$ . Then for $y\in B_\delta (x_{\sigma ^{k+1}}(\Phi ^k))$ ,
Since, by construction, $h_{x_{\sigma ^{k+1}}} (B_\delta (x_{\sigma ^{k+1}}(\Phi ^k)))\subset B_\delta (x_{\sigma ^{k+1}}(\Phi ^k))$ , we have that applying $\phi _{\sigma ^j_{\bar s},\delta ,v}$ differs from applying $\phi _{\sigma ^j_{\bar s}}$ only if
But then Lemma 4.10 implies that
Then our choice of $\delta $ and the induction hypothesis implies that $x_{\sigma ^{k+1}}(\Phi ^k)=x_{(\sigma ^j_{s+1}, \ldots , \sigma ^j_q)}(\Phi ^k)\not \in D^{N}(\Phi ^k)$ , which is contrary to our current assumption. It follows that, provided $x_{\sigma ^{j}}(\Phi _{\delta ,v})\in B_\delta (x_{\sigma ^{k+1}}(\Phi ^k))$ ,
To simplify notation, let $z(\delta ,v)=x_{\sigma ^{j}}(\Phi _{\delta ,v})$ , $h_{\delta ,v}=h_{x_{\sigma ^{k+1}}(\Phi ^k),\delta ,v}$ and $\phi _{\delta ,v}=\phi _{\sigma ^j}\circ h_{\delta ,v}$ . We can study $z(\delta ,v)$ by applying the implicit function theory, which yields
If $z(\delta ,v)\in B_{\delta /2}(x_{\sigma ^{k+1}}(\Phi ^k))$ , then $h_{\delta ,v}(z(\delta ,v))=z(\delta ,v)+\delta ^3 v$ , and $\partial _\delta h_{\delta ,v} (z(\delta ,v))=3\delta ^2v$ . Thus, setting $A:=D_{z(\delta ,v)+\delta ^3v}\phi _{\delta ,v}$ we obtain
Since the maximal eigenvalue of is bounded by $(1-\unicode{x3bb} )^{-1}\unicode{x3bb} $ , there exists a $\delta _*$ such that, for all $\delta \leq \delta _*$ , $z(\delta ,v)\in B_{\delta /2}(x_{\sigma ^{k+1}}(\Phi ^k))$ . Moreover,
and thus $z(\delta ,v)\not \in B_{(3/2)\delta ^2\|v\|\Lambda ^{-N}}(x_{\sigma ^{k+1}}(\Phi ^k))$ . Finally, for each $\delta \leq \delta _0$ ,
from which the last statement of the lemma follows.
By Lemma 4.18, equation (4.17) and our choice of $\varepsilon _{k}$ , $x_{\sigma ^j}(\Phi _{\delta ,v})\not \in D^{N}_{3\delta _k/4}(\Phi ^{k})$ and $D^{N}(\Phi ^{k+1})\subset D^{N}_{\delta _k/4}(\Phi ^{k})$ for all $j\leq k$ . Thus, $x_{\sigma ^j}(\Phi _{\delta ,v})\not \in D^{N}(\Phi ^{k+1})$ for all $j\leq k$ . We are left with $x_{\sigma ^{k+1}}(\Phi ^{k+1})$ , recalling that $x_{\sigma ^{k+1}}(\Phi ^k) \in D^{N}(\Phi ^k)$ . Let $\omega \in \Sigma ^m_{q,i}$ , $q\leq N$ and $M \in M_{\omega ,i}^{N}(\Phi ^k)$ such that $x_{\sigma ^{k+1}}(\Phi ^k)\in M$ . Then
First, suppose that $\omega _q\neq \sigma ^{k+1}_p$ . It follows that, for $y\in B_{\delta /2}(x_{\sigma ^{k+1}}(\Phi ^k))$ ,
only if, for some $s< q$ , $\omega _s=\sigma ^{k+1}_p$ and
but this is ruled out by our choice of $\delta _k$ and Proposition 4.11. The above discussion shows that $M\cap B_{\delta /2}(x_{\sigma ^{k+1}}(\Phi ^k))$ is an element of $M_{\omega , i}^{N}(\Phi _{\delta ,v})$ as well. Hence, it suffices to ensure that $x_{\sigma ^{k+1}}(\Phi _{\delta ,v})\not \in M$ . Since Lemma 4.18 shows that varying v the fixed point visits an open ball, and since M has zero measure, it follows that there exists an open set of v which yields the wanted property.
It remains to analyse the case $\omega _q= \sigma ^{k+1}_p$ . In this case, for $y\in B_{\delta /4}(x_{\sigma ^{k+1}}(\Phi ^k))$ ,
only if equation (4.19) is satisfied, which, by our choice of $\delta _k$ , is possible only if $x_\omega (\Phi ^k)=x_\sigma (\Phi ^k)$ . But then Proposition 4.11 implies that there exist $\sigma ^r\preceq \sigma ^{k+1}$ such that $\phi _\omega =\phi _{\sigma ^r}^{m_1}$ and $\phi _\sigma =\phi _{\sigma ^r}^{m_2}$ , which would mean that
Again, Lemma 4.18 allows to find an open set of v for which $x_{\sigma ^{k+1}}(\Phi ^k)\not \in \overline {\psi _i(D_i)}$ . The last possibility is that
This implies that the perturbed manifold M is displaced by, at most, $2\Lambda ^{N}\delta ^3\|v\|$ whereas Lemma 4.18 implies that the fixed point moves by at least $(3/2)\delta ^2\|v\|\Lambda ^{-N}\geq 2\Lambda ^{N}\delta ^3\|v\|$ . Hence, again we have an open set of v, which produces perturbations with the wanted property. As a last observation, note that if there are other manifolds $M\in D^{N}(\Phi ^k)$ such that $x_{\sigma ^{k+1}}(\Phi ^k)\in M$ , then we can repeat the same argument and we have just a smaller open set of v that does the job. This concludes the overall induction and hence the proof of Proposition 4.17.
4.5. Perturbations with low complexity and the proof of Proposition 4.3
Thanks to Proposition 4.17, we can finally construct the wanted perturbation $\tilde {f}$ .
Let f be a piecewise smooth contraction with the maximal partition ${\boldsymbol P}(f)=\{P_1,P_2,\ldots ,P_m\}$ . Let $l_{0}\in {\mathbb N}$ be the number of manifolds in $\partial {\boldsymbol P}(f)$ . Define ${l_1=\max \{c_d l_{0},d\}}$ . (Recall that, by construction, $c_d$ is the maximal number of manifolds in $M^N_{\sigma ,i}$ , $N\in {\mathbb N}$ and $\sigma \in \Sigma ^m_{0,i}$ , that can contain a point in $\psi _i(D_i^+)$ .)
Given two manifolds defined by maps $\psi _1,\psi _2$ , we write $\psi _1{\pitchfork }\psi _2$ if the manifolds are transversal (see Definition B.1 for the definition of transversality). On the contrary, if the two manifolds have an open (in the relative topology) intersection, we call them compatible and write $\psi _1\curlywedge \psi _2$ . If two manifolds are not compatible, then we write $\psi_1\not\hskip-3.5pt\curlywedge\hskip2pt \psi_2$ .
Proposition 4.19. Let $f:X\to X$ be a piecewise smooth contraction with maximal partition ${\boldsymbol P}(f)$ . Then, for any $N\in {\mathbb N}$ and $\varepsilon>0$ small enough, there exists a piecewise smooth contraction $\tilde {f}$ , with $d_2(f,\tilde {f})<\varepsilon $ such that no more than $2^{dm^{d-1} l_1}$ partition elements of ${\boldsymbol P}(\tilde {f}^N)$ can meet at one point.
Proof. Before starting the proof, we need to introduce some language.
Consider an IFS $\Phi _{\! f}=\{\phi _1,\phi _2,\ldots ,\phi _m\}$ associated to f with contraction coefficient $\unicode{x3bb} $ . Let $\varepsilon _0\in (0,\varepsilon /4)$ be small enough. Then, by Proposition 4.17, there exist $\delta _*\leq \delta _\varepsilon \in (0,\varepsilon _1)$ and an $\varepsilon _0$ -perturbation (here, and in the following, by perturbation we mean a function that is ${\mathcal C}^2$ close and with a uniformly bounded ${\mathcal C}^3$ norm) of $ \Phi _{\! f}$ (which, abusing the notation, we still call $\Phi _{\! f}$ ) such that, for every $p\leq N$ , $\sigma =(\sigma _1,\ldots ,\sigma _p)\in \Sigma ^{m}_{p,*}$ and $\xi \in D^{N}_{\delta _*}(\Phi _{\! f}\!)\cap V_p$ ,
Note that there exists $\epsilon _0\leq \varepsilon _0$ such that (4.20) persists for $\epsilon _0$ -perturbations of $\Phi $ .
By compactness, for $\delta \in (0,\min \{\delta _{*}/2,\delta _N\})$ , where $\delta _N>0$ is such that it satisfies the condition of Remark 4.13 for each $\epsilon _0$ -perturbation, there exists a finite open cover $\{B_{\delta /2}(z_i)\}_{i=1}^t$ of $\overline {D_{\delta }^N(\Phi _{\! f}\!)}$ (which, by definition, contains $\partial {\boldsymbol P}(f^N) $ ) such that, for each i, $z_i\in P_j$ and $B_{\delta }(z_i)\subset V_j$ , for some j. (See the discussion at the beginning of §4.4 for the definition of $V_j$ .)
Let $Y=\overline Y\subset {\mathbb R}^d$ be compact, such that $\overline {\phi (Y)}\subset Y$ for all $\phi \in \Phi $ .
By convention, we set ${\mathcal Z}^1_0(\Phi )=\{Y\}$ and ${\mathcal Z}^1_1(\Phi ):=\{\psi _{\omega }\;;\;\omega \in \{1,\ldots ,l_{*}\}\}$ to be the collection of the manifolds $\psi _i(A)$ for $A\in {\mathcal W}^N_i$ , as defined in (4.13). Also, we call ${\mathcal Z}^1_k(\Phi ):=\{\psi ^k_{\omega }\;:\;\omega \in \{1,\ldots ,l_{*}\}^k\}$ the manifolds consisting of the intersection of the k manifolds $\{\psi _{\omega _i}\;:\;i\in \{1,\ldots ,k\}; i\neq j\!\implies \!\psi _{\omega _i}\not\hskip-3pt\curlywedge\hskip2pt\psi_{\omega_j}\} $ . (These are indeed manifolds; see Definition 2.16. To simplify notation, we use $\psi _\omega $ both for the manifold and for the map that defines it.) By construction, ${\mathcal Z}^1_k(\Phi )=\emptyset $ for $k>l_0$ . In addition, the maximal dimension of the manifolds in ${\mathcal Z}^1_k(\Phi )$ , for $k>1$ , is $d-2$ (since the boundary manifolds are pairwise transversal; see Definition 2.16). Note that ${\mathcal Z}^1_1(\Phi _{\! f}\!)$ is a collection that covers the boundary manifolds; for simplicity, we call the elements of ${\mathcal Z}^1_1(\Phi _{\! f}\!)$ , from now on, original boundary manifolds. For each $s\in {\mathbb N}$ and $k_1,\ldots , k_n\in {\mathbb N}\cup \{0\}$ , let ${{\mathcal Z}^s_0=\{Y\}, U_0=Y}$ and (to alleviate notation, from now on we write $\phi _{\sigma _i}^{-1}$ to mean the inverse of $\phi _{\sigma _i}|_{U_{\sigma _i}}\!$ , while the domain of $\phi _{\sigma _i}^{-1}\circ \psi $ consists of the points where the composition is well defined)
Note that ${\mathcal Z}^s_*$ contains the admissible pre-images of the boundary manifolds under composition of at most s maps in $\Phi $ and all their intersections. In particular, the sets in ${\mathcal Z}^s_1$ cover $\partial {\mathcal P}(f^s)$ , and we call them boundary manifolds. In addition, if a set belongs to ${\mathcal Z}^s_k(\Phi )$ , then, by definition, it is determined by the intersection of the pre-images of k original boundary manifolds. Also, we remark that ${\mathcal Z}^{s+1}_k\supset {\mathcal Z}^s_k$ since $\phi _0=\mathrm {id}$ (see Remark 4.14 and Definition (4.12)). Next, let $N_s$ be the maximal number of manifolds that can intersect in ${\mathcal Z}^s_*$ : that is, ${\mathcal Z}^s_k=\emptyset $ for $k>N_s$ . We have seen that $N_1\leq l_1$ . Moreover, each original boundary manifold can have at most $m^s$ different pre-images obtained by the compositions of s maps. This implies that, at each point, we have at our disposal at most $\sum _{s'=0}^{s-1} m^{s'}l_1$ different manifolds from $\bigcup _i M^N_{0,i}$ to intersect. If $m=1$ , then $N_s\leq sl_1$ ; if $m\geq 2$ , then $N_s\leq ({m^{s}-1})/({m-1}) l_1< 2m^{s-1}l_1$ . Accordingly, $N_s\leq s m^{s-1} l_1$ . (We remark that, by definition, the pre-images are taken via invertible maps, and hence the manifolds cannot self-intersect.)
Our goal is to produce a sequence of perturbations $\Phi ^s$ of $\Phi _{\! f}=:\Phi ^0$ such that $\Phi ^{s}$ is a $2^{-s}\varepsilon $ perturbation of $\Phi ^{s-1}$ with the following property.
-
(⋆) The set ${\mathcal Z}^{s}_k(\Phi ^s)$ consists of manifolds of dimension strictly smaller than $d-j$ for all $k> jm^{j-1}l_1$ , whereas ${\mathcal Z}^{s}_1(\Phi ^s)$ consists of $d-1$ dimensional manifolds. This property persists for small perturbations of $\Phi ^s$ .
Note that the above implies that ${\mathcal Z}^{s}_k(\Phi ^{s})=\emptyset $ for each $s\in {\mathbb N}$ and $k> dm^{d-1}l_1$ . Accordingly, at most $dm^{d-1} l_1$ pre-images of the original boundary manifolds under composition of at most s elements of $\Phi ^s$ can have non-empty intersections. In turn, defining $f_s(x)=\phi _i(x)$ for $x\in P_i$ and $\phi _i\in \Phi ^s$ , we obtain a perturbation of f smaller than $\sum _{j=1}^{s}2^{-j}\varepsilon \leq \varepsilon $ such that ${\mathcal P}(f_s^s)$ has at most $2^{dm^{d-1} l_1}$ elements meeting at point.
Indeed, suppose p elements of ${\mathcal P}(f_s^s)$ meet at a point x. The boundaries of such elements in a neighbourhood small enough of x consist of codimension one manifolds belonging to ${\mathcal Z}^s_1(\Phi ^s)$ , and they have to intersect at x. Suppose that the total number of such boundary manifolds is q. Then x must belong to a manifold in ${\mathcal Z}^s_q(\Phi ^s)$ , and hence it must be ${q\leq dm^{d-1} l_1}$ . Note that we can uniquely define a partition element by specifying on which side it lies with respect to all its boundary manifolds. Since there are at most $2^q$ possibilities, it must be $ p\leq 2^q$ . It follows that $p\leq 2^{dm^{d-1} l_1}$ . The lemma then follows by choosing $s=N$ .
It remains to prove property $(\star )$ . We proceed by induction. If $s=1$ and $k\in \{1,\ldots ,l_1\}$ , then the manifolds in ${\mathcal Z}^1_k$ are indeed of codimension at least one, and the manifolds in ${\mathcal Z}^1_1$ are of codimension one, whereas, if $k>l_1$ , then ${\mathcal Z}^1_k=\emptyset $ , so $\Phi _{\! f}=\Phi ^0$ satisfies our hypothesis. We assume that the hypothesis is verified for some s and prove it for $s+1$ .
Let $\epsilon _s\leq \min \{2^{-s-1}\varepsilon ,\epsilon _0\}$ be such that all the $\epsilon _s$ -perturbations of $\Phi ^s$ still satisfy $(\star )$ . This implies that, provided $\Phi ^{s+1}$ is a $\epsilon _s$ perturbation of $\Phi ^s$ , ${\mathcal Z}_*^{s'}(\Phi ^{s+1})$ has the wanted property for all $s'\leq s$ .
Accordingly, we must analyse only sets of the type $\phi _{\sigma _1}^{-1}\circ \psi _{1}\cap \cdots \cap \phi _{\sigma _n}^{-1}\circ \psi _n$ , where $\psi _i\in {\mathcal Z}_{k_i}^s(\Phi ^s)$ and $\phi _{\sigma _i}\in \Phi ^{s}\cup \{\mathrm {id}\}$ , $\sigma _i\in \{0,\ldots ,m\}$ . (By an innocuous abuse of notation here we use $\psi _i$ to refer to generic elements.) By definition, such sets are elements of ${\mathcal Z}^{s+1}_k(\Phi ^s)$ , with $k=\sum _{i=1}^n k_i$ . Note that the $\psi _i\in {\mathcal Z}^s_0(\Phi ^s)$ do not contribute to the intersection. We can thus assume, without loss of generality, that $k_i>0$ . Note that if $n=1$ , then the manifolds belong to $\bigcup _{i=0}^m\phi _i^{-1}({\mathcal Z}_*^s(\Phi ^s))\subset {\mathcal Z}_*^{s+1}(\Phi ^{s})$ which have automatically the wanted property, and so has any $\epsilon _s$ -perturbation. We consider thus only the case $n\geq 2$ . In addition, if $\phi _{\sigma _i}=\phi _{\sigma _j}$ , $i\neq j$ , then $\phi _{\sigma _i}^{-1}\circ \psi _i\cap \phi _{\sigma _j}^{-1}\circ \psi _j=\phi _{\sigma _i}^{-1}\circ (\psi _j\cap \psi _j)$ , and since, by definition, $\psi _i\not\hskip-3pt\curlywedge\hskip2pt\psi_j $ , $\psi _j\cap \psi _j\in {\mathcal Z}^s_{k_i+k_j}(\Phi ^s)$ . Hence, we can substitute to the intersection of the manifolds $\phi _{\sigma _i}^{-1}\circ \psi _i\cap \phi _{\sigma _j}^{-1}\circ \psi _j$ the manifold $\phi _{\sigma _i}^{-1}\circ (\psi _j\cap \psi _j)$ . We can thus assume, without loss of generality, that $i\neq j$ implies that $\phi _{\sigma _i}\neq \phi _{\sigma _j}$ .
We define the map $F:{\mathbb R}^d\to {\mathbb R}^{nd}$ by $F(x):=(\phi _{\sigma _1}(x),\ldots , \phi _{\sigma _n}(x))$ and the stratified sub-variety $C=\{(\psi _{1}(x_0), \ldots \psi _{n}(x_n))\;:\; x_i\in \overline D_{i}\}$ , where $D_i\subset {\mathbb R}^{d_i}$ is the domain of the map $\psi _i$ . By Lemma B.4, for a constant c to be chosen later, there exists a $\tfrac 12 c\epsilon _s$ -perturbation $\hat F=(\hat \phi _{\sigma _1}(x),\ldots , \hat \phi _{\sigma _n}(x))$ of F, transversal to C. If $\phi _{\sigma _i}\neq \mathrm {id}$ for all i, then we set $\tilde F=\hat F$ . If, for some i, $\phi _{\sigma _i}=\mathrm {id}$ , then $\hat \phi _{\sigma _i}$ is a small perturbation of identity, and hence it is invertible with ${\mathcal C}^3$ inverse. (Indeed, if $\|h-\mathrm {id}\|_{{\mathcal C}^1}=\alpha <1$ , then h is a diffeomorphism. In fact, if $h(x)=h(y)$ ,
which implies that $\|x-y\|\leq \alpha \|x-y\|$ : that is $x=y$ . Thus, h is globally invertible, and the claim follows by the inverse function theorem.) By possibly relabelling, we can assume that $i=1$ . Then we define
$\tilde F$ is still transversal to C and, if c is small enough, by Lemmata C.1 and C.2, it is a $c\epsilon _s$ -perturbation of F. Let $d_i=d-j_i$ be the dimension of the manifold $\psi _{i}$ . Then ${k_i\leq j_im^{j_i-1}l_1}$ . Lemma B.4 implies that the sets $\bigcap _{i=1}^n\phi _{\sigma _i}^{-1}\circ \psi _i$ are manifolds with dimension (actually, they are stratified sub-varieties, but we can restrict them to manifolds without loss of generality)
Note that
Accordingly, if $k>j m^{j-1}l_1$ , then $\bar j\geq j+1$ and the manifold has a dimension strictly smaller than $d-j$ , as required. (Of course, if $k>dm^{d-1}l_1$ , then the intersection is empty.)
We would then like to define a perturbed IFS $\tilde \Phi ^{s}$ as
Unfortunately, this may perturb the new manifolds $\psi _i$ as well, since they are now defined via pre-images of $\tilde {\Phi }^{s}$ . This is the last problem we need to take care of. To this end, we make the perturbation only locally starting from the ball $B_{\delta }(z_1)$ . Once we check that the perturbation is as required in this ball, we will consider the other balls, making the new perturbations small enough not to upset the property obtained in $B_{\delta /2}(z_1)$ .
Let $g\in {\mathcal C}^\infty ({\mathbb R}^d,[0,1])$ be such that
and $\|g\|_{{\mathcal C}^r}\leq C\delta ^{-r}$ for $r\in \{0,1,2,3\}$ for some $C>0$ . Define, for each $i\in \{1,\ldots ,m\}$ ,
Provided we choose c small enough,
Then we define the perturbation $\Phi ^{s,1}=\{\phi _{i,1}\}$ . Note that the $\Phi ^{s,1}$ equals $\Phi ^{s}$ outside the ball $B_\delta (z_1)$ and agrees with $\tilde \Phi ^s$ inside the ball $B_{\delta /2}(z_1)$ .
Recall that we perturbed the system in order to control the intersection of the manifolds $\phi _{\sigma _1}^{-1}\circ \psi _1\cap \cdots \cap \phi _{\sigma _n}^{-1}\circ \psi _n$ . By construction, each $\psi _i$ is the intersection of manifolds $\phi _{\sigma ^{i,j}_{s}}^{-1}\circ \cdots \phi _{\sigma ^{i,j}_1}^{-1}\circ \bar \psi _{i,j}$ with $\bar \psi _{i,j}\in {\mathcal Z}_1^1(\Phi ^s)$ and $\sigma ^{i,j}\in \{0,\ldots ,m\}^s$ . We are thus interested in $A:=\phi _{\sigma _{i},1}^{-1}\circ \phi _{\sigma ^{i,j}_{s},1}^{-1}\circ \cdots \circ \phi _{\sigma ^{i,j}_1,1}^{-1}\circ \bar \psi _{i,j}\cap B_{\delta }(z_1) $ , which are the perturbation of ${\phi _{\sigma _i}^{-1}\circ \psi _i}$ . Let
By choosing $\epsilon _s$ small enough, we have $\|h_i- \mathrm {id}\|_{{\mathcal C}^2}\leq \delta /4$ ; in particular, $h_i$ is a diffeomorphism. It follows that $h_i(B_\delta (z_1))\subset B_\delta (z_1)$ . Let $x\in A$ . Then Proposition 4.17 implies that
and hence the part of $\psi _i$ contained in $\phi _{\sigma ^{i},1}(B_\delta (z_1))$ is unchanged. This implies that, inside the ball $B_{\delta /2}(z_1)$ , the IFS $\Phi ^{s,1}$ has the wanted property for the manifold $\phi _{\sigma _{1},1}^{-1}\circ \psi _1\cap \cdots \cap \phi _{\sigma _n,1}^{-1}\circ \psi _n$ . Moreover, by the openness of the transversality property, there exists $\epsilon _{s,1}\leq \epsilon _s/4$ such that the wanted property persists in $B_{\delta /2}(z_1)$ for each $\epsilon _{s,1}$ perturbation. We can now consider all the other pre-images and do the same procedure with ${\epsilon _{s,j}\leq 4^{-j-1}\epsilon _s}$ for the jth intersection manifold. In this way, we can construct an IFS $\Phi ^{s,q}=:\tilde \Phi ^{s,1}$ for some $q\in {\mathbb N}$ that is a $\epsilon _s/3$ perturbation of $\Phi ^s$ and has the wanted property in $B_{\delta /2}(z_1)$ . We can then repeat the same procedure in the ball $B_\delta (z_2)$ to obtain an IFS $\tilde \Phi ^{s,2}$ that is a $ \epsilon _s/9$ perturbation of $\tilde \Phi ^{s,1}$ , small enough not to upset what we have achieved in $B_{\delta /2}(z_1)$ . Iterating such a construction, we finally obtain $\Phi ^{s+1}=\tilde \Phi ^{s,t}$ , which has the wanted property on all the space since $\{B_{\delta /2}(z_i)\}_{i=1}^t$ is a covering of $ D^N(\Phi ^s_{\! f}\!)$ and is a $\sum _{i=1}^{t}\epsilon _s 3^{-k}\leq \epsilon _s\leq 2^{-s-1}\varepsilon $ perturbation of $\Phi ^s$ . This concludes the induction argument.
Finally, we can prove Proposition 4.3
Proof of Proposition 4.3
Let f be a piecewise smooth contraction with contraction coefficient $\unicode{x3bb} <1$ and maximal partition ${\boldsymbol P}(f)=\{P_1, P_2, \ldots , P_m\}$ . If $\unicode{x3bb} <1/2m$ , then the proof is complete; otherwise, we have the following.
Let the IFS associated to f be $\Phi _{\! f}=\{\phi _1,\phi _2,\ldots ,\phi _m\}$ and let $l_1=\max \{c_d l_0,d\}$ , where $l_0$ is the number of boundary manifolds in $\partial {\boldsymbol P}(f)$ , d is the dimension of the space and $c_d$ is the maximum number of original boundary manifold overlaps (as defined in (4.13)). Let $N\in {\mathbb N}$ be the least number such that $\unicode{x3bb} ^N 2^{ dm^{d-1} l_1} <1/4$ . By Lemma 4.19, for $\varepsilon>0$ small enough, there exists a piecewise contraction $\tilde {f}$ such that $d_2(f,\tilde {f})<\varepsilon $ and no more than $2^{dm^{d-1} l_1}$ elements of the partition $\partial {\boldsymbol P}(\tilde f^N)$ have a non-empty intersection of their closure.
Accordingly, for each $x\in X$ , there is a $\delta (x)$ such that $B_{\delta (x)}(x)$ intersects at most $2^{dm^{d-1} l_1}$ elements of ${\boldsymbol P}(\tilde f^N)$ . Since X is compact, we can extract a finite cover $\{B_{\delta (x_j)/2}(x_j)\}$ . Set $\delta =\tfrac 12\min \{\delta (x_j)\}$ and let $k\in {\mathbb N}$ be such that, for any partition element $P\in {\boldsymbol P}(\tilde {f}^{kN})$ , $\operatorname {diam}(\tilde {f}^{kN}(P))<\delta /2$ ; hence $\tilde {f}^{kN}(P)\subset B_{\delta (x_j)}(x_j)$ for some j. Therefore, it can intersect at most $2^{dm^{d-1} l_1}$ elements of ${\boldsymbol P}(\tilde f^N)$ .
To conclude, let L be the number of elements of ${\boldsymbol P}(\tilde {f}^{kN})$ . Then $\#{\boldsymbol P}(\tilde {f}^{2kN})\leq L2^{dm^{d-1} l_1}$ and $\#{\boldsymbol P}(\tilde {f}^{jkN})\leq L(2^{dm^{d-1} l_1})^{j}$ for $j\in {\mathbb N}$ . Since $\unicode{x3bb} ^{kN}(2^{dm^{d-1} l_1})<1/4$ , there exists $j_*\in {\mathbb N}$ such that $L(2^{dm^{d-1} l_1})^{j_*}\unicode{x3bb} ^{j_*kN}< 1/2$ . Hence $\tilde {f}^{j_*kN}$ is strongly contracting.
Acknowledgements
We thank Henk Bruin, Oliver Butterley, Giovanni Canestrari, Roberto Castorrini, Stefano Galatolo, Arnaldo Nogueira, Benito Pires and Antonio Rapagnetta for helpful discussions, comments and suggestions. We would also like to thank the anonymous referee for pointing out the inconsistencies and typographical errors in the previous version. This work was supported by the PRIN Grants ‘Regular and stochastic behaviour in dynamical systems’ (PRIN 2017S35EHN), ‘Stochastic properties of dynamical systems’ (PRIN 2022NTKXCX) and by the MIUR Excellence Department Project MathTOV awarded to the Department of Mathematics, University of Rome Tor Vergata. S.J. was additionally funded by ARC (DP220100492) and acknowledges Monash University and C.L. acknowledges membership to the GNFM/INDAM.
A Appendix. Extension theorem
Here we discuss the extension theorems needed in the paper. Recall the classical extension theorem for Lipschitz functions.
Theorem A.1. ((Kirszbraun–Valentine theorem) [Reference Valentine26])
Let $f:S(\subset {\mathbb R}^d)\to {\mathbb R}^d$ be a Lipschitz continuous function. Then f can be extended to any set $T \subset {\mathbb R}^d$ to a Lipschitz continuous function with the same Lipschitz constant.
The above can be easily extended to ${\mathcal C}^r$ functions.
Theorem A.2. ( ${\mathcal C}^r$ version of Kirszbraun–Valentine theorem)
Let $S\subset {\mathbb R}^d$ be a compact set and let $f:S(\subset {\mathbb R}^d)\to {\mathbb R}^d$ be a ${\mathcal C}^r$ function, for $r\in {\mathbb N}$ , such that $\mathrm{Lip}(f)=\unicode{x3bb} <1$ and $f^{-1}|_S$ is ${\mathcal C}^1$ . Then f can be extended to $ {\mathbb R}^d$ to a ${\mathcal C}^r$ function $f_*$ such that $\mathrm{Lip}(f_*)=\mathrm{Lip}(f)$ and $f_*^{-1}|_S=f^{-1}|_S$ .
Proof. Note that $\|f|_S\|_{{\mathcal C}^r}$ being finite and S being compact implies that there exists an open neighbourhood U of S such that f is ${\mathcal C}^r$ in U. By the inverse function theorem, f is invertible in U with $\|f^{-1}|_{U}\|_{{\mathcal C}^1}<\infty $ . By the Kirszbraun–Valentine theorem A.1, there exists $\tilde {f}: {\mathbb R}^d\to {\mathbb R}^d$ such that $\tilde {f}|_S=f$ and $\mathrm{Lip}(\tilde {f})=\mathrm{Lip}(f)=\unicode{x3bb} $ . Then ${\tilde {f}|_{U}\circ f^{-1}|_{U}=\mathrm {id}}$ , so we can define ${\tilde {f}}^{-1}|_{U}=f^{-1}$ . Now let $\phi :{\mathbb R}^d\to {\mathbb R}^d$ be a ${\mathcal C}^\infty $ function compactly supported on $\overline {U}$ and $\int \phi =1$ and define the convolution
For $\delta>0$ , let V be a $\delta $ -neighbourhood of U. Define a ${\mathcal C}^\infty $ function $g:{\mathbb R}^d\to {\mathbb R}$ as
such that $\|g\|_{{\mathcal C}^r}<c_r$ . Finally, define $f_*:{\mathbb R}^d\to {\mathbb R}^d$ as $f_*(x)=g(x)f(x)+(1-g(x))\tilde {f}\star \phi (x)$ . Let $x\in U$ . Then
Therefore, $f_*$ is an extension of f. To check whether it is Lipschitz, let $x_1,x_2\in {\mathbb R}^d$ , then
Hence, $f_*$ is Lipschtiz with $\mathrm{Lip}(f_*)=\mathrm{Lip}(f)$ . Now, using Lemma 5.2 in [Reference Castorrini and Liverani5],
Therefore, $f_*\in {\mathcal C}^r$ .
B Appendix. Transversality
For the convenience of the reader, we state the transversality theorem as used in the main text. We refer to [Reference Arnold2] for details. The theorem in [[Reference Arnold2], Ch. 6, §29.E] is stated for smooth maps and manifolds, but it can easily be reduced to the following version by using the ${\mathcal C}^r$ version of Sard’s theorem. Also, the author discusses in detail the extension of the theorem to stratified sub-varieties, which is the case we are interested in and for which we state the theorem. One can also find the ${\mathcal C}^r$ version of the transversality theorem in [Reference Abraham1], but there the reader needs to be mindful of the specific properties they ask on manifolds.
Definition B.1. (Transversal mapping)
For every manifold $A,B$ and submanifold $C\subset B$ , a ${\mathcal C}^r$ mapping $f:A\to B$ is said to be transversal to $C (f \pitchfork C)$ at a point a if either $f(a)\notin C$ or the tangent plane to C at $f(a)$ , if $f(a)\in \partial C$ , and the image of the tangent plane to A at a are transversal: that is,
f is said to be transversal to C if f is transversal to C at a for every $a\in A$ .
Definition B.2. For every manifold $A,B$ and stratified sub-variety $C\subset B$ , a ${\mathcal C}^r$ mapping $f:A\to B$ is said to be transversal to C if it is transversal to C and all its substrata.
Theorem B.3. (Transversality theorem- ${\mathcal C}^r$ version [[Reference Arnold2], Ch. 6, §29.E])
Let A be a compact manifold and let C be a compact stratified sub-variety of a manifold B. Then the ${\mathcal C}^r$ mappings $f:A\to B$ with $f \pitchfork C$ form an open everywhere dense set in the space of all ${\mathcal C}^r$ mappings $A\to B$ .
We apply the above theorem to the following situation (Y and the functions $\phi _i$ are as defined in the §4.1).
Lemma B.4. Let $Y=\{x:\|x\|\leq R\}\subset {\mathbb R}^d$ , for some $R>0$ , let $B\subset Y$ be open and let $\phi _i\in {\mathcal C}^3({\mathbb R}^d,{\mathbb R}^d)$ , $i\in \{1,\ldots ,m\}$ be such that they are invertible when restricted to B. Also, let $W_i\subset B$ , $i\in \{1,\ldots , m\}$ , be $d_i$ -dimensional compact manifolds with boundaries. The maps $F:{\mathbb R}^d\to {\mathbb R}^{md}$ , defined by $F(x):=(\phi _{1}(x),\ldots , \phi _{m}(x))$ , which are transversal to the stratified sub-variety $C=W_1\times \cdots \times W_m$ form an open and dense set. Moreover, if $F|_{B} \pitchfork C$ , then the manifold $(\bigcup _{i=1}^m \phi _i^{-1} (W_i\setminus \partial W_i))\bigcap B$ is empty if there exists k such that $\sum _{i=1}^md_i-(k-1)d<0$ ; otherwise, it has dimension at most $\sum _{i=1}^md_i-(m-1)d$ .
Proof. Our strategy is to transform the present setting to the setting suitable for the application of Theorem B.3 and then proceed further.
The fact that $W_1\times \cdots \times W_k$ is a stratified sub-variety can be checked directly. A minor problem is that neither Y nor ${\mathbb R}^d$ are compact manifolds. To overcome this problem, we define the function $g\in {\mathcal C}^\infty ({\mathbb R},{\mathbb R})$ such that $g(y)=1$ for $y\leq 1$ and $g(y)=0$ for $y\geq a$ , with $|g'(x)|\leq 2a^{-1}$ ( $a>2$ to be chosen large enough), and we define $\hat F:{\mathbb R}^d\to {\mathbb R}^{md}$ as
Note that, for $x\in Y$ , $\hat F=F$ , whereas, for $\|x\|>aR$ , $\hat F(x)=0$ . Hence, $\hat F$ can be seen as a smooth function on the torus ${\mathbb R}^m/ {\mathbb Z}_{aR}$ (which indeed is a compact manifold). We can thus apply Theorem B.3 to obtain the first part of the Lemma.
To prove the second part, note that if $z\in B$ is such that $\hat F(y)=F(y)\in C\setminus \partial C$ , then
If there exists a $y\in {\mathbb R}^d$ such that $F(y)\in C\setminus \partial C$ , then $\phi _i(y)\in W_i$ , and hence ${y=\phi _i^{-1}(W_i)}$ : that is, $\bigcap _{i=1}^m \phi _i^{-1}(W_i)\supset \{y\}\neq \emptyset $ . Next, let $\bar d=\sum _{i=1}^m d_i$ . If $d+\bar d< md$ , then (B.1) cannot be satisfied. It follows that if $\bar d<(m-1)d$ , then $\bigcap _{i=1}^m\phi _i^{-1}(W_i)= \emptyset $ .
If $\bar d\geq (m-1)d$ , we study equation (B.1): for an arbitrary $(\tilde \beta _1,\ldots , \tilde \beta _m)\in {\mathbb R}^{md}$ , there must exist $\alpha \in {\mathbb R}^d$ and $\tilde w_i\in TW_i$ such that $D\phi _i\alpha +\tilde w_i=\tilde \beta _i$ . So, setting $\beta _i=D\phi _i^{-1}(\tilde \beta _i)$ and $w_i=D\phi _i^{-1}( \tilde w_i)$ , we must study the solutions of
Note that $\alpha $ is uniquely determined by $\alpha =\beta _1-w_1$ . Subtracting the second of the (B.2) from the first yields $w_1-w_2=\beta _1-\beta _2$ . If $d_1+d_2<d$ , such an equation has no solution for all $\beta _2$ , so the intersection must be empty. If $s_2=d_1+d_2-d\geq 0$ , then the dimension of $W_{1,2}:=D\phi _i^{-1}W_1\cap D\phi _i^{-1}W_2$ is $s_2$ . We can then write $w_1=\xi _1+\hat w_1$ and $w_2=\xi _1+\hat w_2$ with $\hat w_i\in D\phi _i^{-1}W_i\cap W_{1,2}^\perp $ . It follows that $\hat w_1-\hat w_2=\beta _1-\beta _2$ , which determines uniquely $\hat w_1,\hat w_2$ . We can then write $w_3-\xi _1=\beta _3-\beta _2+\hat w_2$ . Let $s_3=d_3+s_2-d=d_1+d_2+d_3-2d$ . If $s_3<0$ , again, in general, there are no solutions. Otherwise, the dimension of $W_{1,2,3}=W_{1,2}\cap D\phi _i^{-1}W_3$ is $s_3$ and we can write $\xi _1=\xi _2+\hat \xi _1$ , ${w_3=\xi _2+\hat w_3}$ with $\xi _2\in W_{1,2,3}$ and $\hat \xi \in W_{1,2}\cap W_{1,2,3}^\perp $ , $\hat w_3= D\phi _i^{-1}W_3\cap W_{1,2,3}^\perp $ . Accordingly, we have $\hat w_3-\hat \xi _1=\beta _3-\beta _2+\hat w_2$ , which determines uniquely $\hat w_3,\hat \xi _1$ . Continuing in such a way, we have that $W_{1,\ldots ,m}=D\phi _i^{-1}W_1\cap \cdots \cap D\phi _i^{-1}W_m$ has dimension $\bar d-(m-1) d$ . The case in which $F(y)$ belongs to a substrata of C is treated in exactly the same way and yields a lower dimension.
C Appendix. Technical lemmata
In this section, for the convenience of the reader, we collect some simple but boring technical results.
Lemma C.1. There exists $C_{\#}>0$ such that, for given $\varepsilon \in (0,1/2)$ and $f,h\in {\mathcal C}^3$ , where h is a diffeomorphism such that $\|h-\mathrm {id}\|_{{\mathcal C}^2}<\varepsilon $ , we have $h\circ f\circ h^{-1}\in {\mathcal C}^3$ and
Proof. First, we claim that $\|h-\mathrm {id}\|_{{\mathcal C}^2}<\varepsilon $ implies that $\|h^{-1}-\mathrm {id}\|_{{\mathcal C}^1}<2\varepsilon $ . Indeed, there exists a transformation A, with $\|A\|_{{\mathcal C}^1}<1$ , such that we can write . Therefore,
That is,
Moreover, the inverse function theorem implies that $h^{-1}\in {\mathcal C}^3$ . Accordingly, since ${Dh^{-1}=(Dh)^{-1}\circ h^{-1}}$ ,
Hence,
Next,
Thus,
for some constant $C_{\#}>0$ . Finally, $h\circ f\circ h^{-1}\in {\mathcal C}^3$ is the composition of ${\mathcal C}^3$ functions.
Lemma C.2. For given $\varepsilon>0$ , let $g, h\in {\mathcal C}^3$ be such that g is invertible and ${\|g-h\|_{{\mathcal C}^2}<\varepsilon }$ . Then $g^{-1}\circ h\in {\mathcal C}^3$ and
Additionally for $f\in {\mathcal C}^3$ , $f\circ g^{-1}\circ h, (f\circ g^{-1}\circ h)^{-1}\in {\mathcal C}^3$ and
Proof. Since $g\in {\mathcal C}^3$ is invertible, by the inverse function theorem, $g^{-1}\in {\mathcal C}^3$ and therefore there is a composition of ${\mathcal C}^3$ functions $ g^{-1}\circ h\in {\mathcal C}^3$ . Next, let $\Psi =g^{-1}\circ h$ . Then
Accordingly, $\|\Psi -\mathrm {id}\|_{{\mathcal C}^2}\leq C_{\#}\varepsilon \|(Dg)^{-1}\|_{{\mathcal C}^3}$ . Moreover, the first part of Lemma C.1 implies that $\Psi $ is invertible and $\|\Psi ^{-1}-\mathrm {id}\|_{{\mathcal C}^2}\leq C_{\#}\varepsilon \|(Dg)^{-1}\|_{{\mathcal C}^3}$ . This implies that $f\circ \Psi , \Psi ^{-1}\circ f^{-1}\in {\mathcal C}^3$ and
from which the Lemma follows.
Let f be a piecewise smooth contraction with an associated IFS $\Phi =\{\phi _1,\ldots ,\phi _m\}$ , as in equation (4.11), let $\Sigma ^m_{n,i}(\Phi )$ be the set of i-admissible sequences and, as in equation (4.15), let $D^{N}_\delta (\Phi )$ be a $\delta -$ neighbourhood of the boundary of partition ${\boldsymbol P}(f^{N})$ for $\delta>0$ . We have the following result.
Lemma C.3. For a piecewise smooth contraction f with IFS $\Phi =\{\phi _1,\ldots ,\phi _m\}$ and $N\in {\mathbb N}$ , there exists $\varepsilon>0$ such that, for $\tilde f$ with associated IFS $\tilde \Phi =\{\tilde \phi _1,\ldots ,\tilde \phi _m\}$ satisfying $d_2(f,\tilde f)<\varepsilon $ , we have $\Sigma ^m_{n,i}(\Phi )=\Sigma ^m_{n,i}(\tilde \Phi )$ . Moreover, there exists $\delta>0$ such that $D^{N}_{\delta /4}(\tilde \Phi )\subset D^{N}_{\delta /2}(\Phi )$ .
Proof. By hypothesis, there exists $M>0$ such that $\|\phi _i|_{U_i}^{-1}\|_{{\mathcal C}^1}\leq M$ and we can restrict to such a set by the definition of $D^{N}$ which entails only admissible sequences. Thus,
For $n\leq N$ and admissible sequence $\sigma =(\sigma _1,\sigma _2,\ldots ,\sigma _n)\in \Sigma ^m_n$ ,
Note that, by definition (4.11), there exists $\varepsilon _0>0$ such that, for each $\varepsilon $ -perturbation $\tilde \Phi $ of $\Phi $ , with $\varepsilon \in (0,\varepsilon _0)$ , $\Sigma ^m_{n,i}(\Phi )=\Sigma ^m_{n,i}(\tilde \Phi )$ . Moreover, for each $\sigma \in \Sigma ^m_{n,i}(\Phi )$ and ${\xi \in \psi _i^{-1}(M_{\sigma ,i}(\Phi ))}$ ,
Thus, for $\varepsilon $ small enough, $D^{N}(\tilde \Phi )\subset D^{N}_{\delta /4}(\Phi )$ , and hence $D^{N}_{\delta /4}(\tilde \Phi )\subset D^{N}_{\delta /2}(\Phi )$ .