Wednesday, July 7, 2010

Suse linux misc

Some system related tools like zypper, yast, ldconfig could not be accessed by usual users. Only when log in as root user can you find those commands.

To stopping the GUI and using the terminal on Ubuntu, do:

sudo /etc/init.d/gdm stop

then, press "Ctrl+Alt+F1"

On the Suse server, to build the package Theano to work with Python, you have to compile the Python to shared library. That's to build Python in following steps:

./configure --enable-shared --prefix=xxxx

make

sudo make install

Otherwise, it probably gives the error "libpython2.6.a: could not read symbols: Bad value"

Posted via email from Troy's posterous

Saturday, July 3, 2010

Two more papers on speaker adaptation, 2010

Download now or preview on posterous

New Speaker Adaptation Method Using 2-D PCA.pdf (276 KB)

Download now or preview on posterous

Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification.pdf (149 KB)

Posted via email from Troy's posterous

Two new speaker adaptation methods

Speaker adaptation using generalised low rank approximations of training matrices

http://scitation.aip.org/getabs/servlet/GetabsServlet?prog=normal&id=ELLEAK000046000010000724000001&idtype=cvips&gifs=yes&ref=no

Abstract:

Aspeaker adaptation method based on the low rank approximation ofmatrices (GLRAM) of training models is described. In the method,each model is represented as a matrix, and a setof such training matrices is decomposed into a set ofspeaker weights and two basis matrices for row and columnspaces by reducing both row and column ranks of thetraining models. As a result, the speaker weight becomes amatrix, the row and column dimensions of which can beadjusted. In the isolated-word experiment, the proposed method showed betterperformance than both eigenvoice and MLLR for the adaptation dataof about 20 s or longer.

Bilinear model for speaker adaptation using tensor analysis

http://scitation.aip.org/getabs/servlet/GetabsServlet?prog=normal&id=ELLEAK000046000003000243000001&idtype=cvips&gifs=yes&ref=no

Abstract:

Anovel speaker adaptation method based on two-way analysis of trainingspeakers is described. A set of training models is expressedas a tensor and is decomposed into two factors usingnonlinear iterative partial least squares, producing a bilinear model. Theresulting model has bases of lower dimension and more freeparameters than those of eigenvoice, enabling more elaborate modelling fora moderate amount of adaptation data. Results from the isolated-wordrecognition test show that the proposed model outperforms both eigenvoiceand maximum likelihood linear regression (MLLR) for adaptation data longerthan 15 s. Moreover, the proposed method can straightforwardly be extendedto n-way analysis, e.g. for simultaneous adaptation of speaker, environment,etc.

Posted via email from Troy's posterous