Title:
|
Unsupervised spectral learning of finite-state transducers
|
Author:
|
Bailly, Raphaël; Carreras Pérez, Xavier; Quattoni, Ariadna Julieta
|
Other authors:
|
Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics; Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural; Universitat Politècnica de Catalunya. LARCA - Laboratori d'Algorísmia Relacional, Complexitat i Aprenentatge |
Abstract:
|
Finite-State Transducers (FST) are a standard tool for modeling paired inputoutput
sequences and are used in numerous applications, ranging from computational
biology to natural language processing. Recently Balle et al. presented a spectral algorithm for learning FST from samples of aligned input-output sequences. In this paper we address the more realistic, yet challenging setting where the alignments are unknown to the learning algorithm. We frame FST learning as finding a low rank Hankel matrix satisfying constraints derived from observable statistics. Under this formulation, we provide identifiability results for FST distributions. Then, following previous work on rank minimization, we propose a regularized convex relaxation of this objective which is based on minimizing a nuclear norm penalty subject to linear constraints and can be solved efficiently. |
Abstract:
|
Peer Reviewed |
Subject(s):
|
-Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural -Finite-state transducers -Finite State Transducers
Spectral Learning -Transductors d'estats finits |
Rights:
|
|
Document type:
|
Article - Submitted version Conference Object |
Share:
|
|