Regular expression containment: coinductive axiomatization and computational interpretation

Fritz Henglein; Lasse Nielsen

doi:10.1145/1925844.1926429

Regular expression containment: coinductive axiomatization and computational interpretation

16 Citationer (Scopus)

Abstract

We present a new sound and complete axiomatization of regular
expression containment. It consists of the conventional axiomatization
of concatenation, alternation, empty set and (the singleton set
containing) the empty string as an idempotent semiring, the fixedpoint
rule E = 1 + E E for Kleene-star, and a general coinduction
rule as the only additional rule.
Our axiomatization gives rise to a natural computational interpretation
of regular expressions as simple types that represent parse
trees, and of containment proofs as coercions. This gives the axiomatization
a Curry-Howard-style constructive interpretation: Containment
proofs do not only certify a language-theoretic containment,
but, under our computational interpretation, constructively
transform a membership proof of a string in one regular expression
into a membership proof of the same string in another regular
expression.
We show how to encode regular expression equivalence proofs
in Salomaa’s, Kozen’s and Grabmayer’s axiomatizations into our
containment system, which equips their axiomatizations with a
computational interpretation and implies completeness of our axiomatization.
To ensure its soundness, we require that the computational
interpretation of the coinduction rule be a hereditarily total
function. Hereditary totality can be considered the mother of syntactic
side conditions: it “explains” their soundness, yet cannot be
used as a conventional side condition in its own right since it turns
out to be undecidable.
We discuss application of regular expressions as types to bit
coding of strings and hint at other applications to the wide-spread
use of regular expressions for substring matching, where classical
automata-theoretic techniques are a priori inapplicable.
Neither regular expressions as types nor subtyping interpreted
coercively are novel per se. Somewhat surprisingly, this seems to
be the first investigation of a general proof-theoretic framework for
the latter in the context of the former, however.

Originalsprog	Engelsk
Tidsskrift	A C M / S I G P L A N Notices
Vol/bind	46
Udgave nummer	1
Sider (fra-til)	385-398
Antal sider	14
ISSN	1523-2867
DOI	https://doi.org/10.1145/1925844.1926429
Status	Udgivet - jan. 2011
Begivenhed	38th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages - Austin, USA Varighed: 26 jan. 2011 → 28 jan. 2011 Konferencens nummer: 38

Konference

Konference	38th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages
Nummer	38
Land/Område	USA
By	Austin
Periode	26/01/2011 → 28/01/2011

Adgang til dokumentet

10.1145/1925844.1926429

Citationsformater

@inproceedings{4f428203ef784acda95ae6a872007995,

title = "Regular expression containment: coinductive axiomatization and computational interpretation",

abstract = "We present a new sound and complete axiomatization of regular expression containment. It consists of the conventional axiomatization of concatenation, alternation, empty set and (the singleton set containing) the empty string as an idempotent semiring, the fixedpoint rule E* = 1 + E×E* for Kleene-star, and a general coinduction rule as the only additional rule. Our axiomatization gives rise to a natural computational interpretation of regular expressions as simple types that represent parse trees, and of containment proofs as coercions. This gives the axiomatization a Curry-Howard-style constructive interpretation: Containment proofs do not only certify a language-theoretic containment, but, under our computational interpretation, constructively transform a membership proof of a string in one regular expression into a membership proof of the same string in another regular expression. We show how to encode regular expression equivalence proofs in Salomaa's, Kozen's and Grabmayer's axiomatizations into our containment system, which equips their axiomatizations with a computational interpretation and implies completeness of our axiomatization. To ensure its soundness, we require that the computational interpretation of the coinduction rule be a hereditarily total function. Hereditary totality can be considered the mother of syntactic side conditions: It {"}explains{"} their soundness, yet cannot be used as a conventional side condition in its own right since it turns out to be undecidable. We discuss application of regular expressions as types to bit coding of strings and hint at other applications to the wide-spread use of regular expressions for substring matching, where classical automata-theoretic techniques are a priori inapplicable. Neither regular expressions as types nor subtyping interpreted coercively are novel per se. Somewhat surprisingly, this seems to be the first investigation of a general proof-theoretic framework for the latter in the context of the former, however.",

author = "Fritz Henglein and Lasse Nielsen",

note = "Proc. 38th ACM SIGACT-SIGPLAN Symposium on Principles of Programming Languages (POPL); 38th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, POPL 2011 ; Conference date: 26-01-2011 Through 28-01-2011",

year = "2011",

month = jan,

doi = "10.1145/1925844.1926429",

language = "English",

volume = "46",

pages = "385--398",

journal = "SIGPLAN Notices (ACM Special Interest Group on Programming Languages)",

issn = "1523-2867",

publisher = "Association for Computing Machinery / Special Interest Group on Programming Languages",

number = "1",

}

TY - GEN

T1 - Regular expression containment

T2 - 38th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages

AU - Henglein, Fritz

AU - Nielsen, Lasse

N1 - Conference code: 38

PY - 2011/1

Y1 - 2011/1

N2 - We present a new sound and complete axiomatization of regular expression containment. It consists of the conventional axiomatization of concatenation, alternation, empty set and (the singleton set containing) the empty string as an idempotent semiring, the fixedpoint rule E* = 1 + E×E* for Kleene-star, and a general coinduction rule as the only additional rule. Our axiomatization gives rise to a natural computational interpretation of regular expressions as simple types that represent parse trees, and of containment proofs as coercions. This gives the axiomatization a Curry-Howard-style constructive interpretation: Containment proofs do not only certify a language-theoretic containment, but, under our computational interpretation, constructively transform a membership proof of a string in one regular expression into a membership proof of the same string in another regular expression. We show how to encode regular expression equivalence proofs in Salomaa's, Kozen's and Grabmayer's axiomatizations into our containment system, which equips their axiomatizations with a computational interpretation and implies completeness of our axiomatization. To ensure its soundness, we require that the computational interpretation of the coinduction rule be a hereditarily total function. Hereditary totality can be considered the mother of syntactic side conditions: It "explains" their soundness, yet cannot be used as a conventional side condition in its own right since it turns out to be undecidable. We discuss application of regular expressions as types to bit coding of strings and hint at other applications to the wide-spread use of regular expressions for substring matching, where classical automata-theoretic techniques are a priori inapplicable. Neither regular expressions as types nor subtyping interpreted coercively are novel per se. Somewhat surprisingly, this seems to be the first investigation of a general proof-theoretic framework for the latter in the context of the former, however.

AB - We present a new sound and complete axiomatization of regular expression containment. It consists of the conventional axiomatization of concatenation, alternation, empty set and (the singleton set containing) the empty string as an idempotent semiring, the fixedpoint rule E* = 1 + E×E* for Kleene-star, and a general coinduction rule as the only additional rule. Our axiomatization gives rise to a natural computational interpretation of regular expressions as simple types that represent parse trees, and of containment proofs as coercions. This gives the axiomatization a Curry-Howard-style constructive interpretation: Containment proofs do not only certify a language-theoretic containment, but, under our computational interpretation, constructively transform a membership proof of a string in one regular expression into a membership proof of the same string in another regular expression. We show how to encode regular expression equivalence proofs in Salomaa's, Kozen's and Grabmayer's axiomatizations into our containment system, which equips their axiomatizations with a computational interpretation and implies completeness of our axiomatization. To ensure its soundness, we require that the computational interpretation of the coinduction rule be a hereditarily total function. Hereditary totality can be considered the mother of syntactic side conditions: It "explains" their soundness, yet cannot be used as a conventional side condition in its own right since it turns out to be undecidable. We discuss application of regular expressions as types to bit coding of strings and hint at other applications to the wide-spread use of regular expressions for substring matching, where classical automata-theoretic techniques are a priori inapplicable. Neither regular expressions as types nor subtyping interpreted coercively are novel per se. Somewhat surprisingly, this seems to be the first investigation of a general proof-theoretic framework for the latter in the context of the former, however.

U2 - 10.1145/1925844.1926429

DO - 10.1145/1925844.1926429

M3 - Conference article

SN - 1523-2867

VL - 46

SP - 385

EP - 398

JO - SIGPLAN Notices (ACM Special Interest Group on Programming Languages)

JF - SIGPLAN Notices (ACM Special Interest Group on Programming Languages)

IS - 1

Y2 - 26 January 2011 through 28 January 2011

ER -

Regular expression containment: coinductive axiomatization and computational interpretation

Abstract

Konference

Adgang til dokumentet

Fingeraftryk

Citationsformater