### Abstract

This paper presents a new pattern discovery algorithm for constructing probabilistic finite state automata (PFSA) from symbolic sequences. The new algorithm, described as Compression via Recursive Identification of Self-Similar Semantics (CRISSiS), makes use of synchronizing strings for PFSA to localize particular states and then recursively identifies the rest of the states by computing the n-step derived frequencies. We compare our algorithm to other existing algorithms, such as D-Markov and Casual-State Splitting Reconstruction (CSSR) and show both theoretically and experimentally that our algorithm captures a larger class of models.

Title of host publication | Proceedings of the 2011 American Control Conference, ACC 2011 |

Pages | 125-130 |

Number of pages | 6 |

State | Published - 2011 |

Event | 2011 American Control Conference, ACC 2011 - San Francisco, CA, United States Duration: Jun 29 2011 → Jul 1 2011 |

### Other

Country | United States |

City | San Francisco, CA |

Period | 6/29/11 → 7/1/11 |

- Electrical and Electronic Engineering

