Aperiodic languages p. 1/34. Aperiodic languages. Verimag, Grenoble

Similar documents
Algebraic Approach to Automata Theory

Duality and Automata Theory

Languages and monoids with disjunctive identity

Aperiodic languages and generalizations

F. Blanchet-Sadri and F.D. Gaddis, "On a Product of Finite Monoids." Semigroup Forum, Vol. 57, 1998, pp DOI: 10.

Obtaining the syntactic monoid via duality

L is finite or cofinite}, A + k+1 = { L A + L is a boolean combination of languages of the form L 1 L n (n 1) with L 1, L n A +

The bideterministic concatenation product

2. Syntactic Congruences and Monoids

Duality in Logic. Duality in Logic. Lecture 2. Mai Gehrke. Université Paris 7 and CNRS. {ε} A ((ab) (ba) ) (ab) + (ba) +

The wreath product principle for ordered semigroups

Languages, logics and automata

Semigroup invariants of symbolic dynamical systems

Profinite methods in automata theory

SYNTACTIC SEMIGROUP PROBLEM FOR THE SEMIGROUP REDUCTS OF AFFINE NEAR-SEMIRINGS OVER BRANDT SEMIGROUPS

Complementation of rational sets on countable scattered linear orderings

Algebra Meets Logic: The Case of Regular Languages (With Applications to Circuit Complexity) Denis Thérien, McGill University p.

Polynomial closure and unambiguous product

Overlapping tile automata:

Positive varieties and infinite words

Green s Relations and their Use in Automata Theory

1 Alphabets and Languages

Bridges for concatenation hierarchies

A robust class of regular languages

Definition of Büchi Automata

languages by semifilter-congruences

Tree languages defined in first-order logic with one quantifier alternation

Hierarchy among Automata on Linear Orderings

On closures of lexicographic star-free languages. E. Ochmański and K. Stawikowska

The theory of regular cost functions.

On the membership problem for pseudovarieties

Binary 3-compressible automata

NOTES ON AUTOMATA. Date: April 29,

ON THE STAR-HEIGHT OF SUBWORD COUNTING LANGUAGES AND THEIR RELATIONSHIP TO REES ZERO-MATRIX SEMIGROUPS

SEPARATING REGULAR LANGUAGES WITH FIRST-ORDER LOGIC

Lecture 2: Connecting the Three Models

Büchi Automata and their closure properties. - Ajith S and Ankit Kumar

A SURVEY ON DIFFERENCE HIERARCHIES OF REGULAR LANGUAGES

Closure Properties of Regular Languages. Union, Intersection, Difference, Concatenation, Kleene Closure, Reversal, Homomorphism, Inverse Homomorphism

Some results on the generalized star-height problem

STAR-FREE GEODESIC LANGUAGES FOR GROUPS

WHEN ARE REES CONGRUENCES PRINCIPAL?

Finite n-tape automata over possibly infinite alphabets: extending a Theorem of Eilenberg et al.

On Strong Alt-Induced Codes

On the Accepting Power of 2-Tape Büchi Automata

BENJAMIN LEVINE. 2. Principal Ideal Domains We will first investigate the properties of principal ideal domains and unique factorization domains.

From p-adic numbers to p-adic words

Automata Theory and Formal Grammars: Lecture 1

Sri vidya college of engineering and technology

Theory of Computation

The Pumping Lemma and Closure Properties

MATH 433 Applied Algebra Lecture 22: Semigroups. Rings.

Profinite methods in automata theory

LOGIC MEETS ALGEBRA: THE CASE OF REGULAR LANGUAGES

CPSC 421: Tutorial #1

Lecture Notes On THEORY OF COMPUTATION MODULE -1 UNIT - 2

On Recognizable Languages of Infinite Pictures

Semigroup presentations via boundaries in Cayley graphs 1

Aperiodic String Transducers

ON LOGICAL HIERARCHIES WITHIN FO 2 -DEFINABLE LANGUAGES

Subrings and Ideals 2.1 INTRODUCTION 2.2 SUBRING

Learning of Bi-ω Languages from Factors

GROUPS. Chapter-1 EXAMPLES 1.1. INTRODUCTION 1.2. BINARY OPERATION

Theme and variations on the concatenation product

Büchi Automata and Their Determinization

Finite Automata and Regular Languages

Notes on Monoids and Automata

On the use of guards for logics with data

Relational morphisms, transductions and operations on languages

Research Statement. MUHAMMAD INAM 1 of 5

Applied Automata Theory

CS 133 : Automata Theory and Computability

Unit 6. Non Regular Languages The Pumping Lemma. Reading: Sipser, chapter 1

Solutions I.N. Herstein- Second Edition

Pushdown Automata. We have seen examples of context-free languages that are not regular, and hence can not be recognized by finite automata.

(1) A frac = b : a, b A, b 0. We can define addition and multiplication of fractions as we normally would. a b + c d

A NOTE ON STAR-FREE EVENTS. Albert R. Meyer. Carnegie-Mellon University Pittsburg _, Pennsylvania February, ]968

Probabilistic Aspects of Computer Science: Probabilistic Automata

One-relation languages and ω-code generators

LTL is Closed Under Topological Closure

On the coinductive nature of centralizers

CERNY CONJECTURE FOR DFA ACCEPTING STAR-FREE LANGUAGES

CSci 311, Models of Computation Chapter 4 Properties of Regular Languages

Automata on linear orderings

Stone Duality. An application in the theory of formal languages. A Master thesis by Mirte Dekkers Under supervision of Prof. dr.

Definitions, Theorems and Exercises. Abstract Algebra Math 332. Ethan D. Bloch

Membership and Finiteness Problems for Rational Sets of Regular Languages

Languages. A language is a set of strings. String: A sequence of letters. Examples: cat, dog, house, Defined over an alphabet:

Theory of computation: initial remarks (Chapter 11)

Valence automata over E-unitary inverse semigroups

Limited Codes Associated with Petri Nets

Solutions to Some Review Problems for Exam 3. by properties of determinants and exponents. Therefore, ϕ is a group homomorphism.

CSE 105 Homework 1 Due: Monday October 9, Instructions. should be on each page of the submission.

Mergible States in Large NFA

On Recognizable Languages of Infinite Pictures

UNIT-III REGULAR LANGUAGES

Joseph Muscat Universal Algebras. 1 March 2013

HKN CS/ECE 374 Midterm 1 Review. Nathan Bleier and Mahir Morshed

Algebras with finite descriptions

Varieties Generated by Certain Models of Reversible Finite Automata

Transcription:

Aperiodic languages p. 1/34 Aperiodic languages Dejan Ničković Verimag, Grenoble

Aperiodic languages p. 2/34 Table of Contents Introduction Aperiodic Sets Star-Free Regular Sets Schützenberger s theorem SF AP Semigroups and Monoids Morphisms Recognizability Transition Monoid Syntactic Monoid Ideals Green Relations AP SF Example

Aperiodic languages p. 3/34 Introduction First (and most) significant subclass of regular languages. For a language L A, the following conditions are equivalent: L is first-order definable (in the signature <) L is definable in linear temporal logic (past) L is star-free regular L is recognized by a counter-free automaton Syntactic monoid of L is aperiodic Note: Similar results hold for ω-regular star-free languages There is an algorithm to decide whether a given recognizable language is star-free

Aperiodic languages p. 4/34 Aperiodic Sets A set X A is said to be aperiodic if there exists an integer n > 0 such that for all u, x, v A one has ux n v L ux n+1 v L An automaton is counter-free if there is no word w that permutes a non-trivial subset of Q

Aperiodic languages p. 5/34 Aperiodicity - Intuition a a a a a a Counter modulo 3 Counter saturating at 2 x n+p 1 x n x 1 x 2 x 3 x n 1 x n+1 x n+2 x 1 x 2 x 3 x n 1 x n Periodic semigroup generated by x with period p (x n = x n+p ) Aperiodic semigroup generated by x with index n (x n = x n+1 )

Aperiodic languages p. 6/34 Star-Free Regular Sets The set of regular subsets of A is the smallest set of subsets of A closed under finite union, concatenation and Kleen star. The set of star-free subsets of A is the smallest set of subsets of A containing the finite sets and closed under all finite boolean operations and concatenation. Extended regular expressions without the Kleene star operator Examples: A = (ab) = ɛ + (aa A b) (A aaa A bba ) (ab ba) = (A a)b(ab) (aa ) + (A b)(ab) a(ba )

Aperiodic languages p. 7/34 Star-free and Aperiodic Sets A rational set X A is star-free iff it is aperiodic Two proofs: Schützenberger (based on structure of monoids) SF AP : easy AP SF : much more difficult Use Krohn-Rhodes Decomposition

Aperiodic languages p. 8/34 Schützenberger s Theorem - SF AP Let i(x) be the smallest index that saturates aperiodic set X A i(a) = 2 for a A (trivial) i(x Y ) min(i(x), i(y )) (trivial) i(x) = i(x) (trivial) i(xy ) i(x) + i(y ) Proof Let N = m + n + 1 st N i(x) + i(y ) Let ux m r X and sx n v Y such that x = rs. Then ux N v XY. Either m i(x) or n i(y ). In the first case ux m+1 r X. Hence ux m+1 rsx n v = ux m+n+2 v = ux N+1 v XY. Second case and the other direction of the proof are similar.

Aperiodic languages p. 9/34 Semigroups and Monoids A semigroup is a set equipped with an associative operation. A monoid is a semigroup equipped with an identity (S,, 1) ab S for a, b S (ab)c = a(bc) for all a, b, c S 1a = a1 = 1 for all a S Example: A is the set of all words over an alphabet A and is called the free monoid with the null word ɛ and the concatenation of words being the monoid product. (N, +, 0) (N,, 1) Note: We call zero of S an element denoted by 0 such that 0s = s0 = 0 for all s S.

Aperiodic languages p. 10/34 Morphisms A morphism between two algebraic structures is a map that preserves the operations and the relations of the structure (applies to semigroups and monoids). Given two semigroups (S, ) and (T, ), a semigroup morphism ϕ : S T is a map from S into T st for all x, y S, ϕ(x y) = ϕ(x) ϕ(y) (for monoid morphisms we also need the condition ϕ(1 S ) = 1 T ). T is said to be homomorphic to S and we denote it by T ϕ S. Two mutually homomorphic semigroups are said to be isomorphic We are mainly interested in the homomorphisms from the infinite free monoid A onto finite monoids.

Aperiodic languages p. 11/34 Recognizable Languages Recognizable languages usually defined in terms of automata Let A = (Q, A, E, I, F) be a finite automaton. The language recognized by A is the set (denoted by L (A)) of the labels of all paths having its origin in I and its and in F. A language L is recognizabe if there exists a finite automaton A st L = L (A) Recognizible languages coincide with the class of regular languages (Kleene) Handling the fine structure of recognizable languages is more appropriate using a more abstract definition, using monoids (or semigroups) A monoid morphism ϕ : A M recognizes a language L A if L = ϕ 1 ϕ(l) If u L and ϕ(u) = ϕ(v) implies v L There is P M such that L = ϕ 1 (P)

Aperiodic languages p. 12/34 Transition Monoid Equivalence between automata and monoids Let A = (Q, A, E, I, F). To each word w A, there corresponds a relation on Q, denoted by ϕ(w) and defined by (p, q) ϕ(w) if there exists a path from p to q with label w. Monoid morphism: ϕ : A M(A) Monoid morphism from A into the semigroup of relations on Q Let R1, R 2 be two relations on Q. Then R 1 R 2 defined by (p, q) R 1 R 2 iff there exists r Q st (p, r) R 1 and (r, q) R 2. ϕ(a ) is called transition monoid (or transformation monoid) of A and is denoted by M(A). A word w is recognized by A iff (p, q) ϕ(w) for some initial state p and some final state q. If a finite automaton recognizes a language L, then its transition monoid recognizes L.

Aperiodic languages p. 13/34 Syntactic Monoid Let L A. The syntactic congruence L is defined over A by u L v iff for all x, t A, xuy L xvy L Syntactic monoid of L is the quotient monoid M = A / L. Each word in A is mapped to the congruence class modulo L of which that word is a member (many-to-one mapping). It is the smallest monoid recognizing L. Syntactic monoid recognizing L is the transition monoid of the minimal deterministic automaton recognizing L.

Aperiodic languages p. 14/34 Syntactic Monoid - Example a a 1 2 1 2 b a A = {a, b} L 1 = (ab), M 1 = ϕ(l 1 ) L 2 = (aa), M 2 = ϕ(l 2 ) 1 α β 0 αβ βα 1 1 2 / / 1 / 2 2 / 1 / / 2 1 α 0 1 1 2 / 2 2 1 / M 1 = {1, α, β, 0, αβ, βα} M 2 = {1, α, β} αβα = α, βαβ = β, α 2 = β 2 = 0 α 2 = 1, α 3 = α, β = 0 x, x 2 = x 3 n, α n α n+1 L 1 aperiodic L 2 not aperiodic

Aperiodic languages p. 15/34 Syntactic Monoid - Monoid Graph ϕ((ab) ) 1 α β 0 αβ βα 1 1 α β 0 αβ βα α α 0 αβ 0 0 α β β βα 0 0 β 0 0 0 0 0 0 0 0 αβ αβ α 0 0 αβ 0 βα βα 0 β 0 0 βα a α b αβ a a b 1 0 a,b b β a b βα a b Example: ϕ(a) = ϕ(aba) = ϕ(ababa) = α α is the class of words which need a b in order to be accepted.

Aperiodic languages p. 16/34 Sketch of Proof AP SF Show that each subset of an aperiodic monoid can be rewritten as a bollean compination of left, right and full ideals that it generates. Write the inverse morphism of the aperiodic monoid as a boolean combination of sets of prefix and suffix patterns of the words that are part of the language recognized by that monoid, as well as of the patterns not allowed in the middle of the accepted words. Prove by induction on the cardinality of the J -classes of the aperiodic monoid that the sets generated by the inverse morphism are star-free.

Aperiodic languages p. 17/34 Ideals Let S be a semigroup and M = S {1} a monoid. Right ideal of S is R S such that RM = R Left ideal of S is L S such that ML = L Ideal of S is I S such that MIM = I Example: Let P be the set of pair positive integers. P is an ideal of Z. A principal ideal I in a monoid M if it is generated by a single element m M. Example: Let P = 2Z. P is a principal ideal of Z.

Aperiodic languages p. 18/34 Ideals - Example βα α α αβ α β αβ α αβ αβ β βα 1 αβ β βα α β 0 βα βα β α αβ βα 0 Right Ideals of the Syntactic Monoid of (ab) βα α α αβ α β αβ α αβ αβ β βα 1 αβ β βα α β 0 βα βα β α αβ βα 0 Left Ideals of the Syntactic Monoid of (ab)

Aperiodic languages p. 19/34 Green Relations Let M be a monoid. We define 4 equivalence relations over M (L, R, J and H), called Green relations: a R b am = bm a L b Ma = Mb a J b MaM = MbM a H b a R b a L b adb iff c M st alc and crb So a R b, if there exist c, d M such that ac = b and bd = a. a and b are R-related if they generate the same right (principal) ideal. J D L R H Given that M is a finite monoid and m, n M, m D n iff m J n.

Aperiodic languages p. 20/34 Green Relations - Example 1 α β 0 αβ βα 1 1 α β 0 αβ βα α α 0 αβ 0 0 α β β βα 0 0 β 0 0 0 0 0 0 0 0 αβ αβ α 0 0 αβ 0 βα βα 0 β 0 0 βα 1M = M M1 = M 0M = 0 M0 = 0 αm = {α, αβ, 0} Mα = {α, βα, 0} βm = {β, βα, 0} Mβ = {β, αβ, 0} αβm = αm (αβ R α) Mαβ = Mβ (αβ L β) βαm = βm (βα R β) Mβα = Mα (βα L α)

Aperiodic languages p. 21/34 Green Relations - Example L-classes * 1 J -classes * αβ α R-classes β * βα * 0

Aperiodic languages p. 22/34 Schützenberger s Theorem - Lemma 1 Lemma 1 Let M be an aperiodic monoid and let p, q, r M. If pqr = q, then pq = q = qr. Simplification rule (SR) Proof Let pqr = q. Then p n qr n = q (let n be the index of M) p n+1 qr n = p n qr n = q (M aperiodic) q = p(p n qr n ) = pq Similar for q = qr Example: αβ α βα = α αβ α = α βα = α

Aperiodic languages p. 23/34 Schützenberger s Theorem - Lemma 2 Lemma 2 Let M be an aperiodic monoid and let p, q, r M. Then {q} = (qm Mq)\J q, with J q = {s M q MsM}. Intuition: decompose each subset of M into a boolean combination of right ideals, left ideals and ideals. Proof 1. q (qm Mq)\J q (trivial) 2. Let s (qm Mq)\J q 3. p, r st s = pq = qr (by (2)) 4. s J q. Hence q MsM (by definition of J q ) 5. u, v st q = usv (by (4)) 6. Replace s by pq (by (3)). Hence q = upqr = (up)q(r). 7. q = (up)q (SR). Replace pq by s. q = us. 8. Replace s by qr. Then q = uqr. Using SR q = qr. 9. Since s = qr, we can conclude that q = s.

Aperiodic languages p. 24/34 Schützenberger s Theorem - Lemma 2 Example: αm = {α, αβ, 0}, Mα = {α, βα, 0} Jα = {s M α MsM} αm Mα = {α, 0} MαM = {α, β, αβ, βα, 0} M0M = {0} Jα = {0} (αm Mα)\Jα = α

Aperiodic languages p. 25/34 Schützenberger s Theorem - Proof objective We want to prove that if ϕ : A M is a morphism from A into an aperiodic monoid M, the set ϕ 1 (P) is star-free for every subset P of M ϕ 1 (P) = [ m P ϕ 1 (m) Example: Let L1 = (ab) and M 1 = ϕ(l 1 ) = {1, α, β, 0, αβ, βα}. P = {1, αβ} (elements that lead from initial to final state). Then L 1 = ϕ 1 (1) + ϕ 1 (αβ). We need to show that ϕ 1 (1) and ϕ 1 (αβ) are star-free sets. We should show that ϕ 1 (m) is star-free by induction on the integer r(m) = Card(M\MmM).

Let L = (UA A V )\(A CA A WA ) Aperiodic languages p. 26/34 Schützenberger s Theorem - Inverse morphism We need to show the correctness of the formula ϕ 1 (m) = L where U = S (n,a) E ϕ 1 (n)a C = {a A m Mϕ(a)M} V = S (a,n) F aϕ 1 (n) W = S (a,n,b) G aϕ 1 (n)b E = {(n, a) M A nϕ(a) R m but n mm} F = {(a, n) A M ϕ(a)n L m but n Mm} G = {(a, n, b) A M A m (Mϕ(a)nM Mnϕ(b)M)\Mϕ(a)nϕ(b)M} We need to show that the sets U, V, C and W are star-free with

Aperiodic languages p. 27/34 Schützenberger s Theorem - ϕ 1 (m) L U = S(n,a) E ϕ 1 (n)a E = {(n, a) M A nϕ(a) R m but n mm} Let u ϕ 1 (m) and p the shortest prefix of u st ϕ(p) R m. p cannot be empty, otherwise 1 R m, and in that case m = 1 (by SR). Let p = ra with r A and a A. Let n = ϕ(r). n mm and nϕ(a) R m, hence (n, a) E. By construction u ϕ 1 (n)aa. Therefore, u UA. Symmetric argument in order to show that u A V.

Aperiodic languages p. 28/34 Schützenberger s Theorem - ϕ 1 (m) L C = {a A m Mϕ(a)M} Suppose that u A CA. Then, a st m = ϕ(u) Mϕ(a)M which is a contradiction with the definition of C W = S(a,n,b) G aϕ 1 (n)b G = {(a, n, b) A M A m (Mϕ(a)nM Mnϕ(b)M)\Mϕ(a)nϕ(b)M} Suppose that u A WA. Then exists (a, n, b) G st m Mϕ(a)nϕ(b)M which is a contradiction with the definition of G This concludes our proof that ϕ 1 (m) L

Aperiodic languages p. 29/34 Schützenberger s Theorem - L ϕ 1 (m) Let u L and s = ϕ(u). {m} = (mm Mm)\Jm, with J m = {s M m MsM} Since u UA A V (by def of L), we have s mm Mm. To prove that s = m and hence u ϕ 1 (m), it is enough to prove that s J m, ie. m MsM (by Lemma 2) Suppose m MsM. Let the word f be the minimal factor of u st m Mϕ(f)M f non-empty Case 1: f is a letter. Then this letter is in C and u A CA. Impossible. Case 2: Let f = agb with a, b A. Let n = ϕ(g). f is of minimal legth, hence m Mϕ(a)nM and m Mnϕ(b)M. In that case we have n G and f W. Impossible. We can conclude that s = m. This concludes our proof that L ϕ 1 (m). ϕ 1 (m) = L

Aperiodic languages p. 30/34 Schützenberger s Theorem - Inductive Base Let B = {a A ϕ(a) = 1} B is the set of letters which are self-loops on all the states in the automaton Base: r(m) = 0 Then m = 1, and we can show that ϕ 1 (1) = B and B is a star-free set. r(m) = 0 then M = MmM. Hence, there exists u, v M st umv = (um)1(v) = (u)1(mv) = 1. Apply simplification rule: (u)1 = 1, 1(v) = 1, hence u = v = 1 and consequently m = 1. B = A \ S a A\B A aa

Aperiodic languages p. 31/34 Schützenberger s Theorem - Inductive Step 1. Let r(m) > 0 2. Assume that we have established that ϕ 1 (s) is star-free for each element s with r(s) < r(m) 3. We need to show that UA, A V, A CA and A WA are star-free. 4. U = S (n,a) E ϕ 1 (n)aa 5. E = {(n, a) M A nϕ(a) R m but n mm} 6. Let (n, a) E. Given that nϕ(a) R m we have MmM MnM and hence r(n) r(m) 7. Suppose r(n) = r(m). Then n MmM 8. u, v M st n = umv (by (7)) 9. Since, m nϕ(a)m (by (6)), p M st m = np 10. n = unpv ((8) and substitution by (9)) 11. n = npv = mv (by SL and substitution by (9)) 12. n = mv is a contradiciton with n mm 13. Hence, r(n) < r(m) 14. Symmetric argument to prove that A V is star-free.

Aperiodic languages p. 32/34 Schützenberger s theorem - Inductive Step 1. W = S (a,n,b) G A aϕ 1 (n)ba 2. G = {(a, n, b) A M A m (Mϕ(a)nM Mnϕ(b)M)\Mϕ(a)nϕ(b)M} 3. Let (a, n, b) G. Since m MnM, r(n) r(m). 4. Assume r(n) = r(m). Then MmM = MnM 5. n MmM (by (4)) 6. Since m Mϕ(a)nM and m Mnϕ(b)M (by (3)), then also n Mϕ(a)nM and n Mnϕ(b)M 7. u, v M st n = unϕ(b)v (by (6)) 8. n = nϕ(b)v (by SR and (7)) 9. x, y M st m = xϕ(a)ny (by (6)) 10. m = xϕ(a)nϕ(b)vy (replace n in (9) using (8)) 11. m M ϕ(a)nϕ(b)m Contradiction with definition of G 12. We can conclude that r(n) < r(m)

Aperiodic languages p. 33/34 Schützenberger s theorem - Example Let us calculate ϕ 1 (αβ) UA : Find nϕ(a) R αβ such that n αβm. α R αβ. α = 1α = 1ϕ(a). 1 αβm. Hence UA = aa. A V : Find ϕ(a)n L αβ such that n Mαβ. β L αβ. β = β1 = ϕ(b)1. 1 Mαβ. Hence A V = A b. A WA : Let us first find k such that k MαβM. k = α 0 = α1α = ϕ(a)1ϕ(a). Remember that also α 2 = β 2 = β1β = ϕ(b)1ϕ(b). We verify that αβ MαM and αβ MβM. Hence A WA = A aaa A bba. ϕ 1 (αβ) = (aa A b)\(a aaa A bba )

Aperiodic languages p. 34/34 References G. Lallement, "Semigroups and Combinatorial Applications" O. Maler and A. Pnueli, "On the Cascaded Decomposition of Automata, its Complexity and its Application to Logic" R. McNaughton and S. Papert, "Counter-Free Automata" D. Perrin and J. E. Pin, "Infinite Words" D. Perrin, "Finite Automata", in Handbook of Theoretical Computer Science, Volume B J. E. Pin, "Finite Semigroups and Recognizable Languages: an Introduction" J. E. Pin, "Logic, Semigroups and Automata on Words" J. E. Pin, "Syntactic Semigroups" J. E. Pin, "Variétés de Languages Formels"