Word Motifs and a Generalized Hamming Distance

Authors

  • Pengyu Liu Department of Mathematics and Applied Mathematical Sciences, University of Rhode Island, Kingston, USA https://orcid.org/0000-0001-7154-7546
  • Jingzhou Na Department of Mathematics, Simon Fraser University, Burnaby, Canada

DOI:

https://doi.org/10.37256/cm.6120256175

Keywords:

combinatorics on words, Hamming distance, word patterns, permutations

Abstract

Combinatorics on words is a relatively recent and rich field that involves formal grammar, algebra, geometry, fractals, algorithms, and coding, with initial research focused on repetitions in words. In this paper, we measure the differences between patterns shared by words of the same length. We introduce word motifs to represent collections of words that share the same underlying patterns, and we generalize the Hamming distance for comparing word motifs. A word motif is an equivalence class of words of the same length over an alphabet under the equivalence relation induced by symbol relabeling. We study initial problems in comparing word motifs. We compute the maximal generalized Hamming distance for k word motifs of length n over an alphabet of symbols, and we demonstrate how to calculate the exact generalized Hamming distance between a pair of word motifs.

Downloads

Published

2025-02-20

How to Cite

1.
Liu P, Na J. Word Motifs and a Generalized Hamming Distance. Contemp. Math. [Internet]. 2025 Feb. 20 [cited 2025 Feb. 23];6(1):1255-64. Available from: https://ojs.wiserpub.com/index.php/CM/article/view/6175