In this paper we study the number rbwt of equal-letter runs produced by the Burrows-Wheeler transform (BWT) when it is applied to purely morphic finite words, which are words generated by iterating prolongable morphisms. Such a parameter rbwt is very significant since it provides a measure of the performances of the BWT, in terms of both compressibility and indexing. In particular, we prove that, when BWT is applied to whichever purely morphic finite word on a binary alphabet, rbwt is O(log n), where n is the length of the word. Moreover, we prove that rbwt is Θ(log n) for the binary words generated by a large class of prolongable binary morphisms. These bounds are proved by providing some new structural properties of the bispecial circular factors of such words.

Frosini, A., Mancini, I., Rinaldi, S., Romana, G., Sciortino, M. (2022). Logarithmic Equal-Letter Runs for BWT of Purely Morphic Words. In Developments in Language Theory. DLT 2022. (pp.139-151). Cham : Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-05578-2_11].

Logarithmic Equal-Letter Runs for BWT of Purely Morphic Words

Rinaldi S.;
2022-01-01

Abstract

In this paper we study the number rbwt of equal-letter runs produced by the Burrows-Wheeler transform (BWT) when it is applied to purely morphic finite words, which are words generated by iterating prolongable morphisms. Such a parameter rbwt is very significant since it provides a measure of the performances of the BWT, in terms of both compressibility and indexing. In particular, we prove that, when BWT is applied to whichever purely morphic finite word on a binary alphabet, rbwt is O(log n), where n is the length of the word. Moreover, we prove that rbwt is Θ(log n) for the binary words generated by a large class of prolongable binary morphisms. These bounds are proved by providing some new structural properties of the bispecial circular factors of such words.
2022
978-3-031-05577-5
978-3-031-05578-2
Frosini, A., Mancini, I., Rinaldi, S., Romana, G., Sciortino, M. (2022). Logarithmic Equal-Letter Runs for BWT of Purely Morphic Words. In Developments in Language Theory. DLT 2022. (pp.139-151). Cham : Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-05578-2_11].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1253600