
|
GNUSpeech Console Utility
Hello, I am a new obj-c programmer. I decided to take a look at GNUSpeech because I am interested in speech synthesis and I've wanted to implement an existing open source speech synthesis implementation as an Apple Audio Unit synthesizer for some time. I like the sound of GNUSpeech compared to other synthesis packages I tested, and wanted to learn some of the more advanced oop programming techniqes used in its implimentation such as setting up and managing Events. I decided to start out by creating a simple "Hello World" example utilizing the GNUSpeechserver as a starting point. My goal is to create a package that is not reliant on an external speech daemon/server to synthesize text and has the synthesis callback components compiled within the audiounit. This way I can experiment with the synthesis parameters if I wanted to, ie testing different types of noise, square wave vs. sine wave, etc.
Because I am a bit of a beginner, please let me know if because of my level of experience this email is inappropriate. I'm having trouble with segmentation faults as the program accesses speechtophone methods.
Here is my very naive implementation: #import "TextToSpeech.h" int main (int argc, const char * argv[]) { NSAutoreleasePool * pool = [[NSAutoreleasePool alloc] init]; // insert code here...
NSLog(@"Hello, World!"); TextToSpeech * textToSpeech; NSLog(@"0"); [textToSpeech init]; NSString * stext = @"hello world text"; [textToSpeech speakText:stext];
[textToSpeech dealloc]; //[phoneString dealloc]; [pool drain]; return 0; } I'm sure there are many problems with the above implementation, and I hope they aren't too many to list in a response email. Any nudges in the correct direction would be very helpful. Please don't waste too much of your time on me.
Thank you, John Delaney
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
Hi John,
On Tue, Nov 3, 2009 at 9:38 PM, John Delaney < a.pet.rock@...> wrote:
> Because I am a bit of a beginner, please let me know if because of my level
> of experience this email is inappropriate.
I am a newbie in ObjC too.
> Here is my very naive implementation:
> #import "TextToSpeech.h"
>
> int main (int argc, const char * argv[]) {
> NSAutoreleasePool * pool = [[NSAutoreleasePool alloc] init];
>
> // insert code here...
> NSLog(@"Hello, World!");
>
> TextToSpeech * textToSpeech;
> NSLog(@"0");
> [textToSpeech init];
>
> NSString * stext = @"hello world text";
>
>
> [textToSpeech speakText:stext];
>
>
> [textToSpeech dealloc];
> //[phoneString dealloc];
>
> [pool drain];
> return 0;
> }
You need to allocate the memory first:
TextToSpeech *textToSpeech = [[TextToSpeech alloc] init];
And AFAIK you should use:
[textToSpeech release];
not dealloc.
Regards,
Marcelo
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
Thank you Marcelo, Wow the screen output is really cool! But I'm still not getting sound, and still getting a seg fault at the end. Is there a sound setting I'm missing? Here is the output at the end:
2009-11-03 16:21:11.555 tst_obc[1667:10b] Posture 0 tempo: 0.912, syllable: 0, onset: 0.00, ruleTempo: 1.000, # # Rule 30, duration: 163.63, beat: 113.54 2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 1 tempo: 0.912, syllable: 0, onset: 113.54, ruleTempo: 1.000, h
2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 2 tempo: 0.912, syllable: 1, onset: 113.54, ruleTempo: 1.000, e # Rule 42, duration: 86.69, beat: 180.34 2009-11-03 16:21:11.557 tst_obc[1667:10b] Foot 1 tempo: 0.912, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.557 tst_obc[1667:10b] Posture 3 tempo: 0.912, syllable: 0, onset: 180.34, ruleTempo: 1.000, l # Rule 42, duration: 78.47, beat: 264.69 2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 4 tempo: 0.912, syllable: 0, onset: 264.69, ruleTempo: 1.000, uh # Rule 32, duration: 87.13, beat: 338.40
2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 5 tempo: 0.912, syllable: 0, onset: 338.40, ruleTempo: 1.000, uu # Rule 42, duration: 96.18, beat: 433.24 2009-11-03 16:21:11.559 tst_obc[1667:10b] Foot 2 tempo: 0.942, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.559 tst_obc[1667:10b] Posture 6 tempo: 0.942, syllable: 0, onset: 433.24, ruleTempo: 1.000, w # Rule 42, duration: 108.69, beat: 531.74 2009-11-03 16:21:11.560 tst_obc[1667:10b] Posture 7 tempo: 0.942, syllable: 0, onset: 531.74, ruleTempo: 1.000, er # Rule 42, duration: 104.97, beat: 638.99
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 8 tempo: 0.942, syllable: 0, onset: 638.99, ruleTempo: 1.000, r # Rule 42, duration: 86.40, beat: 739.28 2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 9 tempo: 0.942, syllable: 0, onset: 739.28, ruleTempo: 1.000, ll # Rule 47, duration: 89.26, beat: 846.21
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 10 tempo: 0.942, syllable: 0, onset: 846.21, ruleTempo: 1.000, d # Rule 21, duration: 170.45, beat: 1016.62 2009-11-03 16:21:11.562 tst_obc[1667:10b] Foot 3 tempo: 0.991, marked: 1, last: 1, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.562 tst_obc[1667:10b] Posture 11 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, t' 2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 12 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, e' # Rule 24, duration: 227.35, beat: 1207.80
2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 13 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, k' 2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 14 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, s' # Rule 18, duration: 202.33, beat: 1445.70
2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 15 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, t' 2009-11-03 16:21:11.565 tst_obc[1667:10b] Posture 16 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, #'
2009-11-03 16:21:11.565 tst_obc[1667:10b] < printDataStructures: 2009-11-03 16:21:11.566 tst_obc[1667:10b] sample rate: 22050.000000 2009-11-03 16:21:11.567 tst_obc[1667:10b] format id: 6c70636d (lpcm) 2009-11-03 16:21:11.567 tst_obc[1667:10b] format flags: 1c
2009-11-03 16:21:11.568 tst_obc[1667:10b] bytes per packet: 2 2009-11-03 16:21:11.569 tst_obc[1667:10b] frames per packet: 1 2009-11-03 16:21:11.569 tst_obc[1667:10b] bytes per frame: 2 2009-11-03 16:21:11.570 tst_obc[1667:10b] channels per frame: 1
2009-11-03 16:21:11.570 tst_obc[1667:10b] bits per channel: 16 initializeConversion(), sampleRateConverter.maximumSampleValue: 0 2009-11-03 16:21:11.645 tst_obc[1667:10b] number of samples: 32928 2009-11-03 16:21:11.646 tst_obc[1667:10b] maximum sample value: 0.0011
2009-11-03 16:21:11.646 tst_obc[1667:10b] scale: 29971423.7842 2009-11-03 16:21:11.652 tst_obc[1667:10b] dealloc, free(inputData) Segmentation fault logout Thank you, John Delaney
On Tue, Nov 3, 2009 at 4:13 PM, Marcelo Yassunori Matuda <marcelo.matuda@...> wrote:
Hi John,
On Tue, Nov 3, 2009 at 9:38 PM, John Delaney < a.pet.rock@...> wrote:
> Because I am a bit of a beginner, please let me know if because of my level
> of experience this email is inappropriate.
I am a newbie in ObjC too.
> Here is my very naive implementation:
> #import "TextToSpeech.h"
>
> int main (int argc, const char * argv[]) {
> NSAutoreleasePool * pool = [[NSAutoreleasePool alloc] init];
>
> // insert code here...
> NSLog(@"Hello, World!");
>
> TextToSpeech * textToSpeech;
> NSLog(@"0");
> [textToSpeech init];
>
> NSString * stext = @"hello world text";
>
>
> [textToSpeech speakText:stext];
>
>
> [textToSpeech dealloc];
> //[phoneString dealloc];
>
> [pool drain];
> return 0;
> }
You need to allocate the memory first:
TextToSpeech *textToSpeech = [[TextToSpeech alloc] init];
And AFAIK you should use:
[textToSpeech release];
not dealloc.
Regards,
Marcelo
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
Ok, by not releasing the textToSpeech object, the seg fault went away. Now it makes sound, however, it only makes a chirp, a very small piece of the overall sound. I think I still must be missing something. Thank you,
John Delaney On Tue, Nov 3, 2009 at 4:25 PM, John Delaney <a.pet.rock@...> wrote:
Thank you Marcelo,
Wow the screen output is really cool! But I'm still not getting sound, and still getting a seg fault at the end. Is there a sound setting I'm missing?
Here is the output at the end:
2009-11-03 16:21:11.555 tst_obc[1667:10b] Posture 0 tempo: 0.912, syllable: 0, onset: 0.00, ruleTempo: 1.000, # # Rule 30, duration: 163.63, beat: 113.54 2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 1 tempo: 0.912, syllable: 0, onset: 113.54, ruleTempo: 1.000, h
2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 2 tempo: 0.912, syllable: 1, onset: 113.54, ruleTempo: 1.000, e # Rule 42, duration: 86.69, beat: 180.34 2009-11-03 16:21:11.557 tst_obc[1667:10b] Foot 1 tempo: 0.912, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.557 tst_obc[1667:10b] Posture 3 tempo: 0.912, syllable: 0, onset: 180.34, ruleTempo: 1.000, l # Rule 42, duration: 78.47, beat: 264.69 2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 4 tempo: 0.912, syllable: 0, onset: 264.69, ruleTempo: 1.000, uh # Rule 32, duration: 87.13, beat: 338.40
2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 5 tempo: 0.912, syllable: 0, onset: 338.40, ruleTempo: 1.000, uu # Rule 42, duration: 96.18, beat: 433.24 2009-11-03 16:21:11.559 tst_obc[1667:10b] Foot 2 tempo: 0.942, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.559 tst_obc[1667:10b] Posture 6 tempo: 0.942, syllable: 0, onset: 433.24, ruleTempo: 1.000, w # Rule 42, duration: 108.69, beat: 531.74 2009-11-03 16:21:11.560 tst_obc[1667:10b] Posture 7 tempo: 0.942, syllable: 0, onset: 531.74, ruleTempo: 1.000, er # Rule 42, duration: 104.97, beat: 638.99
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 8 tempo: 0.942, syllable: 0, onset: 638.99, ruleTempo: 1.000, r # Rule 42, duration: 86.40, beat: 739.28 2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 9 tempo: 0.942, syllable: 0, onset: 739.28, ruleTempo: 1.000, ll # Rule 47, duration: 89.26, beat: 846.21
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 10 tempo: 0.942, syllable: 0, onset: 846.21, ruleTempo: 1.000, d # Rule 21, duration: 170.45, beat: 1016.62 2009-11-03 16:21:11.562 tst_obc[1667:10b] Foot 3 tempo: 0.991, marked: 1, last: 1, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.562 tst_obc[1667:10b] Posture 11 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, t' 2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 12 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, e' # Rule 24, duration: 227.35, beat: 1207.80
2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 13 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, k' 2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 14 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, s' # Rule 18, duration: 202.33, beat: 1445.70
2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 15 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, t' 2009-11-03 16:21:11.565 tst_obc[1667:10b] Posture 16 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, #'
2009-11-03 16:21:11.565 tst_obc[1667:10b] < printDataStructures: 2009-11-03 16:21:11.566 tst_obc[1667:10b] sample rate: 22050.000000 2009-11-03 16:21:11.567 tst_obc[1667:10b] format id: 6c70636d (lpcm) 2009-11-03 16:21:11.567 tst_obc[1667:10b] format flags: 1c
2009-11-03 16:21:11.568 tst_obc[1667:10b] bytes per packet: 2 2009-11-03 16:21:11.569 tst_obc[1667:10b] frames per packet: 1 2009-11-03 16:21:11.569 tst_obc[1667:10b] bytes per frame: 2 2009-11-03 16:21:11.570 tst_obc[1667:10b] channels per frame: 1
2009-11-03 16:21:11.570 tst_obc[1667:10b] bits per channel: 16 initializeConversion(), sampleRateConverter.maximumSampleValue: 0 2009-11-03 16:21:11.645 tst_obc[1667:10b] number of samples: 32928 2009-11-03 16:21:11.646 tst_obc[1667:10b] maximum sample value: 0.0011
2009-11-03 16:21:11.646 tst_obc[1667:10b] scale: 29971423.7842 2009-11-03 16:21:11.652 tst_obc[1667:10b] dealloc, free(inputData) Segmentation fault logout
Thank you, John Delaney
On Tue, Nov 3, 2009 at 4:13 PM, Marcelo Yassunori Matuda <marcelo.matuda@...> wrote:
Hi John,
On Tue, Nov 3, 2009 at 9:38 PM, John Delaney < a.pet.rock@...> wrote:
> Because I am a bit of a beginner, please let me know if because of my level
> of experience this email is inappropriate.
I am a newbie in ObjC too.
> Here is my very naive implementation:
> #import "TextToSpeech.h"
>
> int main (int argc, const char * argv[]) {
> NSAutoreleasePool * pool = [[NSAutoreleasePool alloc] init];
>
> // insert code here...
> NSLog(@"Hello, World!");
>
> TextToSpeech * textToSpeech;
> NSLog(@"0");
> [textToSpeech init];
>
> NSString * stext = @"hello world text";
>
>
> [textToSpeech speakText:stext];
>
>
> [textToSpeech dealloc];
> //[phoneString dealloc];
>
> [pool drain];
> return 0;
> }
You need to allocate the memory first:
TextToSpeech *textToSpeech = [[TextToSpeech alloc] init];
And AFAIK you should use:
[textToSpeech release];
not dealloc.
Regards,
Marcelo
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
Hi John,
If you want to use the GnuSpeech server (the recommended way), then you need to setup a basic distributed objects connection with the server. This can be done in just a couple lines of code. There is an example in the applications directory called GnuTTSClient that does exactly this -- it is intended to be a bare-bones application to show how to connect and send messages to the server and have text synthesized. (Your main class will need to conform to the GnuSpeechServerProtocol which can be found in the GnuSpeech Server project directory -- just drop this header into your project directory -- but this should become apparent after examining the GnuTTSClient source).
If you don't want to use the GnuSpeech server then I assume you copied over from the GnuSpeech server project directory the classes and headers, PhoneToSpeech, and TextToSpeech and embedded them in your project. After making the modifications that Marcelo suggested, if this still doesn't work for you, then check some of your build settings (for example garbage collection should not be set to 'required'), and maybe send some additional diagnostic messages from the Console, including the backtrace from the segmentation fault.
Best, Dalmazio
[...]
Thank you Marcelo, Wow the screen output is really cool! But I'm still not getting sound, and still getting a seg fault at the end. Is there a sound setting I'm missing?
Here is the output at the end:
2009-11-03 16:21:11.555 tst_obc[1667:10b] Posture 0 tempo: 0.912, syllable: 0, onset: 0.00, ruleTempo: 1.000, # # Rule 30, duration: 163.63, beat: 113.54 2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 1 tempo: 0.912, syllable: 0, onset: 113.54, ruleTempo: 1.000, h
2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 2 tempo: 0.912, syllable: 1, onset: 113.54, ruleTempo: 1.000, e # Rule 42, duration: 86.69, beat: 180.34 2009-11-03 16:21:11.557 tst_obc[1667:10b] Foot 1 tempo: 0.912, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.557 tst_obc[1667:10b] Posture 3 tempo: 0.912, syllable: 0, onset: 180.34, ruleTempo: 1.000, l # Rule 42, duration: 78.47, beat: 264.69 2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 4 tempo: 0.912, syllable: 0, onset: 264.69, ruleTempo: 1.000, uh # Rule 32, duration: 87.13, beat: 338.40
2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 5 tempo: 0.912, syllable: 0, onset: 338.40, ruleTempo: 1.000, uu # Rule 42, duration: 96.18, beat: 433.24 2009-11-03 16:21:11.559 tst_obc[1667:10b] Foot 2 tempo: 0.942, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.559 tst_obc[1667:10b] Posture 6 tempo: 0.942, syllable: 0, onset: 433.24, ruleTempo: 1.000, w # Rule 42, duration: 108.69, beat: 531.74 2009-11-03 16:21:11.560 tst_obc[1667:10b] Posture 7 tempo: 0.942, syllable: 0, onset: 531.74, ruleTempo: 1.000, er # Rule 42, duration: 104.97, beat: 638.99
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 8 tempo: 0.942, syllable: 0, onset: 638.99, ruleTempo: 1.000, r # Rule 42, duration: 86.40, beat: 739.28 2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 9 tempo: 0.942, syllable: 0, onset: 739.28, ruleTempo: 1.000, ll # Rule 47, duration: 89.26, beat: 846.21
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 10 tempo: 0.942, syllable: 0, onset: 846.21, ruleTempo: 1.000, d # Rule 21, duration: 170.45, beat: 1016.62 2009-11-03 16:21:11.562 tst_obc[1667:10b] Foot 3 tempo: 0.991, marked: 1, last: 1, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.562 tst_obc[1667:10b] Posture 11 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, t' 2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 12 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, e' # Rule 24, duration: 227.35, beat: 1207.80
2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 13 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, k' 2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 14 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, s' # Rule 18, duration: 202.33, beat: 1445.70
2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 15 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, t' 2009-11-03 16:21:11.565 tst_obc[1667:10b] Posture 16 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, #'
2009-11-03 16:21:11.565 tst_obc[1667:10b] < printDataStructures: 2009-11-03 16:21:11.566 tst_obc[1667:10b] sample rate: 22050.000000 2009-11-03 16:21:11.567 tst_obc[1667:10b] format id: 6c70636d (lpcm) 2009-11-03 16:21:11.567 tst_obc[1667:10b] format flags: 1c
2009-11-03 16:21:11.568 tst_obc[1667:10b] bytes per packet: 2 2009-11-03 16:21:11.569 tst_obc[1667:10b] frames per packet: 1 2009-11-03 16:21:11.569 tst_obc[1667:10b] bytes per frame: 2 2009-11-03 16:21:11.570 tst_obc[1667:10b] channels per frame: 1
2009-11-03 16:21:11.570 tst_obc[1667:10b] bits per channel: 16 initializeConversion(), sampleRateConverter.maximumSampleValue: 0 2009-11-03 16:21:11.645 tst_obc[1667:10b] number of samples: 32928 2009-11-03 16:21:11.646 tst_obc[1667:10b] maximum sample value: 0.0011
2009-11-03 16:21:11.646 tst_obc[1667:10b] scale: 29971423.7842 2009-11-03 16:21:11.652 tst_obc[1667:10b] dealloc, free(inputData) Segmentation fault logout
Thank you, John Delaney
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
I recompiled the GNUSpeech Framework so that it referenced my local user library. Garbage collection is off, Instruction scheduling is set to off, compiling to OSX 10.5, using GCC 4.2. I get these errors on the console, I assume before the seg fault. I tried searching for them on my hard drive and they are nowhere to be found. Is this because I am not using the most recent version of Foundation?
Attaching to program: `/Users/johndelaney/Code/tst_obc/build/Debug/tst_obc', process 3303. warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/ah_private.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/../../../frmks/auverse/src/libah/ah_private.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/ah_cl.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/../../../frmks/auverse/src/libah/ah_cl.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/ah_serv.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/../../../frmks/auverse/src/libah/ah_serv.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/main.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/src/main.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/rbuffer.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/../../../frmks/auverse/src/rbuffer/rbuffer.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/megamix.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/src/megamix.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/resampler.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/../../../frmks/auverse/src/resampler/resampler.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/coreaudio.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/src/coreaudio.c".
warning: .o file "/Developer/usr/lib/gcc/i686-apple-darwin9/4.0.1/libgcc.a(_eprintf.o)" more recent than executable timestamp [Switching to process 3303 thread 0x6e03] On Tue, Nov 3, 2009 at 8:49 PM, Dalmazio Brisinda <dbrisinda@...> wrote:
Hi John,
If you want to use the GnuSpeech server (the recommended way), then you need to setup a basic distributed objects connection with the server. This can be done in just a couple lines of code. There is an example in the applications directory called GnuTTSClient that does exactly this -- it is intended to be a bare-bones application to show how to connect and send messages to the server and have text synthesized. (Your main class will need to conform to the GnuSpeechServerProtocol which can be found in the GnuSpeech Server project directory -- just drop this header into your project directory -- but this should become apparent after examining the GnuTTSClient source).
If you don't want to use the GnuSpeech server then I assume you copied over from the GnuSpeech server project directory the classes and headers, PhoneToSpeech, and TextToSpeech and embedded them in your project. After making the modifications that Marcelo suggested, if this still doesn't work for you, then check some of your build settings (for example garbage collection should not be set to 'required'), and maybe send some additional diagnostic messages from the Console, including the backtrace from the segmentation fault.
Best, Dalmazio
[...]
Thank you Marcelo,
Wow the screen output is really cool! But I'm still not getting sound, and still getting a seg fault at the end. Is there a sound setting I'm missing?
Here is the output at the end:
2009-11-03 16:21:11.555 tst_obc[1667:10b] Posture 0 tempo: 0.912, syllable: 0, onset: 0.00, ruleTempo: 1.000, # # Rule 30, duration: 163.63, beat: 113.54 2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 1 tempo: 0.912, syllable: 0, onset: 113.54, ruleTempo: 1.000, h
2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 2 tempo: 0.912, syllable: 1, onset: 113.54, ruleTempo: 1.000, e # Rule 42, duration: 86.69, beat: 180.34 2009-11-03 16:21:11.557 tst_obc[1667:10b] Foot 1 tempo: 0.912, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.557 tst_obc[1667:10b] Posture 3 tempo: 0.912, syllable: 0, onset: 180.34, ruleTempo: 1.000, l # Rule 42, duration: 78.47, beat: 264.69 2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 4 tempo: 0.912, syllable: 0, onset: 264.69, ruleTempo: 1.000, uh # Rule 32, duration: 87.13, beat: 338.40
2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 5 tempo: 0.912, syllable: 0, onset: 338.40, ruleTempo: 1.000, uu # Rule 42, duration: 96.18, beat: 433.24 2009-11-03 16:21:11.559 tst_obc[1667:10b] Foot 2 tempo: 0.942, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.559 tst_obc[1667:10b] Posture 6 tempo: 0.942, syllable: 0, onset: 433.24, ruleTempo: 1.000, w # Rule 42, duration: 108.69, beat: 531.74 2009-11-03 16:21:11.560 tst_obc[1667:10b] Posture 7 tempo: 0.942, syllable: 0, onset: 531.74, ruleTempo: 1.000, er # Rule 42, duration: 104.97, beat: 638.99
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 8 tempo: 0.942, syllable: 0, onset: 638.99, ruleTempo: 1.000, r # Rule 42, duration: 86.40, beat: 739.28 2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 9 tempo: 0.942, syllable: 0, onset: 739.28, ruleTempo: 1.000, ll # Rule 47, duration: 89.26, beat: 846.21
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 10 tempo: 0.942, syllable: 0, onset: 846.21, ruleTempo: 1.000, d # Rule 21, duration: 170.45, beat: 1016.62 2009-11-03 16:21:11.562 tst_obc[1667:10b] Foot 3 tempo: 0.991, marked: 1, last: 1, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.562 tst_obc[1667:10b] Posture 11 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, t' 2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 12 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, e' # Rule 24, duration: 227.35, beat: 1207.80
2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 13 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, k' 2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 14 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, s' # Rule 18, duration: 202.33, beat: 1445.70
2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 15 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, t' 2009-11-03 16:21:11.565 tst_obc[1667:10b] Posture 16 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, #'
2009-11-03 16:21:11.565 tst_obc[1667:10b] < printDataStructures: 2009-11-03 16:21:11.566 tst_obc[1667:10b] sample rate: 22050.000000 2009-11-03 16:21:11.567 tst_obc[1667:10b] format id: 6c70636d (lpcm) 2009-11-03 16:21:11.567 tst_obc[1667:10b] format flags: 1c
2009-11-03 16:21:11.568 tst_obc[1667:10b] bytes per packet: 2 2009-11-03 16:21:11.569 tst_obc[1667:10b] frames per packet: 1 2009-11-03 16:21:11.569 tst_obc[1667:10b] bytes per frame: 2 2009-11-03 16:21:11.570 tst_obc[1667:10b] channels per frame: 1
2009-11-03 16:21:11.570 tst_obc[1667:10b] bits per channel: 16 initializeConversion(), sampleRateConverter.maximumSampleValue: 0 2009-11-03 16:21:11.645 tst_obc[1667:10b] number of samples: 32928 2009-11-03 16:21:11.646 tst_obc[1667:10b] maximum sample value: 0.0011
2009-11-03 16:21:11.646 tst_obc[1667:10b] scale: 29971423.7842 2009-11-03 16:21:11.652 tst_obc[1667:10b] dealloc, free(inputData) Segmentation fault logout
Thank you, John Delaney
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
John,
Could you try installing the latest pre-compiled beta-release version and see if the various applications generate speech? Then we can hopefully rule out any sort of system configuration issue -- at least as far as the executable apps and libs are concerned.
Also, to get the backtrace of the segmentation fault, open Console, and show the Log List. Then scroll down and you should see under the headings DIAGNOSTIC INFORMATION -> User Diagnostic Information -> <the crash file> or FILES -> ~/Library/Logs -> CrashReporter -> <the crash file>
Best, Dalmazio
Send gnuspeech-contact mailing list submissions to gnuspeech-contact@...
To subscribe or unsubscribe via the World Wide Web, visit http://lists.gnu.org/mailman/listinfo/gnuspeech-contact or, via email, send a message with subject or body 'help' to gnuspeech-contact-request@...
You can reach the person managing the list at gnuspeech-contact-owner@...
When replying, please edit your Subject line so it is more specific than "Re: Contents of gnuspeech-contact digest..." Today's Topics:
1. Re: GNUSpeech Console Utility (Dalmazio Brisinda) 2. Re: GNUSpeech Console Utility (John Delaney)
Date: November 3, 2009 9:49:51 PM MST
Subject: Re: [gnuspeech-contact] GNUSpeech Console Utility
Hi John,
If you want to use the GnuSpeech server (the recommended way), then you need to setup a basic distributed objects connection with the server. This can be done in just a couple lines of code. There is an example in the applications directory called GnuTTSClient that does exactly this -- it is intended to be a bare-bones application to show how to connect and send messages to the server and have text synthesized. (Your main class will need to conform to the GnuSpeechServerProtocol which can be found in the GnuSpeech Server project directory -- just drop this header into your project directory -- but this should become apparent after examining the GnuTTSClient source).
If you don't want to use the GnuSpeech server then I assume you copied over from the GnuSpeech server project directory the classes and headers, PhoneToSpeech, and TextToSpeech and embedded them in your project. After making the modifications that Marcelo suggested, if this still doesn't work for you, then check some of your build settings (for example garbage collection should not be set to 'required'), and maybe send some additional diagnostic messages from the Console, including the backtrace from the segmentation fault.
Best, Dalmazio
[...]
Thank you Marcelo, Wow the screen output is really cool! But I'm still not getting sound, and still getting a seg fault at the end. Is there a sound setting I'm missing?
Here is the output at the end:
2009-11-03 16:21:11.555 tst_obc[1667:10b] Posture 0 tempo: 0.912, syllable: 0, onset: 0.00, ruleTempo: 1.000, # # Rule 30, duration: 163.63, beat: 113.54 2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 1 tempo: 0.912, syllable: 0, onset: 113.54, ruleTempo: 1.000, h
2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 2 tempo: 0.912, syllable: 1, onset: 113.54, ruleTempo: 1.000, e # Rule 42, duration: 86.69, beat: 180.34 2009-11-03 16:21:11.557 tst_obc[1667:10b] Foot 1 tempo: 0.912, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.557 tst_obc[1667:10b] Posture 3 tempo: 0.912, syllable: 0, onset: 180.34, ruleTempo: 1.000, l # Rule 42, duration: 78.47, beat: 264.69 2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 4 tempo: 0.912, syllable: 0, onset: 264.69, ruleTempo: 1.000, uh # Rule 32, duration: 87.13, beat: 338.40
2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 5 tempo: 0.912, syllable: 0, onset: 338.40, ruleTempo: 1.000, uu # Rule 42, duration: 96.18, beat: 433.24 2009-11-03 16:21:11.559 tst_obc[1667:10b] Foot 2 tempo: 0.942, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.559 tst_obc[1667:10b] Posture 6 tempo: 0.942, syllable: 0, onset: 433.24, ruleTempo: 1.000, w # Rule 42, duration: 108.69, beat: 531.74 2009-11-03 16:21:11.560 tst_obc[1667:10b] Posture 7 tempo: 0.942, syllable: 0, onset: 531.74, ruleTempo: 1.000, er # Rule 42, duration: 104.97, beat: 638.99
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 8 tempo: 0.942, syllable: 0, onset: 638.99, ruleTempo: 1.000, r # Rule 42, duration: 86.40, beat: 739.28 2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 9 tempo: 0.942, syllable: 0, onset: 739.28, ruleTempo: 1.000, ll # Rule 47, duration: 89.26, beat: 846.21
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 10 tempo: 0.942, syllable: 0, onset: 846.21, ruleTempo: 1.000, d # Rule 21, duration: 170.45, beat: 1016.62 2009-11-03 16:21:11.562 tst_obc[1667:10b] Foot 3 tempo: 0.991, marked: 1, last: 1, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.562 tst_obc[1667:10b] Posture 11 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, t' 2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 12 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, e' # Rule 24, duration: 227.35, beat: 1207.80
2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 13 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, k' 2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 14 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, s' # Rule 18, duration: 202.33, beat: 1445.70
2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 15 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, t' 2009-11-03 16:21:11.565 tst_obc[1667:10b] Posture 16 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, #'
2009-11-03 16:21:11.565 tst_obc[1667:10b] < printDataStructures: 2009-11-03 16:21:11.566 tst_obc[1667:10b] sample rate: 22050.000000 2009-11-03 16:21:11.567 tst_obc[1667:10b] format id: 6c70636d (lpcm) 2009-11-03 16:21:11.567 tst_obc[1667:10b] format flags: 1c
2009-11-03 16:21:11.568 tst_obc[1667:10b] bytes per packet: 2 2009-11-03 16:21:11.569 tst_obc[1667:10b] frames per packet: 1 2009-11-03 16:21:11.569 tst_obc[1667:10b] bytes per frame: 2 2009-11-03 16:21:11.570 tst_obc[1667:10b] channels per frame: 1
2009-11-03 16:21:11.570 tst_obc[1667:10b] bits per channel: 16 initializeConversion(), sampleRateConverter.maximumSampleValue: 0 2009-11-03 16:21:11.645 tst_obc[1667:10b] number of samples: 32928 2009-11-03 16:21:11.646 tst_obc[1667:10b] maximum sample value: 0.0011
2009-11-03 16:21:11.646 tst_obc[1667:10b] scale: 29971423.7842 2009-11-03 16:21:11.652 tst_obc[1667:10b] dealloc, free(inputData) Segmentation fault logout
Thank you, John Delaney
Date: November 4, 2009 12:08:50 AM MST
Subject: Re: [gnuspeech-contact] GNUSpeech Console Utility
I recompiled the GNUSpeech Framework so that it referenced my local user library. Garbage collection is off, Instruction scheduling is set to off, compiling to OSX 10.5, using GCC 4.2. I get these errors on the console, I assume before the seg fault. I tried searching for them on my hard drive and they are nowhere to be found. Is this because I am not using the most recent version of Foundation?
Attaching to program: `/Users/johndelaney/Code/tst_obc/build/Debug/tst_obc', process 3303. warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/ah_private.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/../../../frmks/auverse/src/libah/ah_private.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/ah_cl.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/../../../frmks/auverse/src/libah/ah_cl.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/ah_serv.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/../../../frmks/auverse/src/libah/ah_serv.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/main.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/src/main.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/rbuffer.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/../../../frmks/auverse/src/rbuffer/rbuffer.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/megamix.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/src/megamix.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/resampler.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/../../../frmks/auverse/src/resampler/resampler.c".
warning: Could not find object file "/Users/qdc/Projects/main/tools/hijack/ahs/build/ahs.build/Release FAT/ah_hermes.build/Objects-normal/i386/coreaudio.o" - no debug information available for "/Users/qdc/Projects/main/tools/hijack/ahs/src/coreaudio.c".
warning: .o file "/Developer/usr/lib/gcc/i686-apple-darwin9/4.0.1/libgcc.a(_eprintf.o)" more recent than executable timestamp [Switching to process 3303 thread 0x6e03]
On Tue, Nov 3, 2009 at 8:49 PM, Dalmazio Brisinda <dbrisinda@...> wrote:
Hi John,
If you want to use the GnuSpeech server (the recommended way), then you need to setup a basic distributed objects connection with the server. This can be done in just a couple lines of code. There is an example in the applications directory called GnuTTSClient that does exactly this -- it is intended to be a bare-bones application to show how to connect and send messages to the server and have text synthesized. (Your main class will need to conform to the GnuSpeechServerProtocol which can be found in the GnuSpeech Server project directory -- just drop this header into your project directory -- but this should become apparent after examining the GnuTTSClient source).
If you don't want to use the GnuSpeech server then I assume you copied over from the GnuSpeech server project directory the classes and headers, PhoneToSpeech, and TextToSpeech and embedded them in your project. After making the modifications that Marcelo suggested, if this still doesn't work for you, then check some of your build settings (for example garbage collection should not be set to 'required'), and maybe send some additional diagnostic messages from the Console, including the backtrace from the segmentation fault.
Best, Dalmazio
[...]
Thank you Marcelo,
Wow the screen output is really cool! But I'm still not getting sound, and still getting a seg fault at the end. Is there a sound setting I'm missing?
Here is the output at the end:
2009-11-03 16:21:11.555 tst_obc[1667:10b] Posture 0 tempo: 0.912, syllable: 0, onset: 0.00, ruleTempo: 1.000, # # Rule 30, duration: 163.63, beat: 113.54 2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 1 tempo: 0.912, syllable: 0, onset: 113.54, ruleTempo: 1.000, h
2009-11-03 16:21:11.556 tst_obc[1667:10b] Posture 2 tempo: 0.912, syllable: 1, onset: 113.54, ruleTempo: 1.000, e # Rule 42, duration: 86.69, beat: 180.34 2009-11-03 16:21:11.557 tst_obc[1667:10b] Foot 1 tempo: 0.912, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.557 tst_obc[1667:10b] Posture 3 tempo: 0.912, syllable: 0, onset: 180.34, ruleTempo: 1.000, l # Rule 42, duration: 78.47, beat: 264.69 2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 4 tempo: 0.912, syllable: 0, onset: 264.69, ruleTempo: 1.000, uh # Rule 32, duration: 87.13, beat: 338.40
2009-11-03 16:21:11.558 tst_obc[1667:10b] Posture 5 tempo: 0.912, syllable: 0, onset: 338.40, ruleTempo: 1.000, uu # Rule 42, duration: 96.18, beat: 433.24 2009-11-03 16:21:11.559 tst_obc[1667:10b] Foot 2 tempo: 0.942, marked: 0, last: 0, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.559 tst_obc[1667:10b] Posture 6 tempo: 0.942, syllable: 0, onset: 433.24, ruleTempo: 1.000, w # Rule 42, duration: 108.69, beat: 531.74 2009-11-03 16:21:11.560 tst_obc[1667:10b] Posture 7 tempo: 0.942, syllable: 0, onset: 531.74, ruleTempo: 1.000, er # Rule 42, duration: 104.97, beat: 638.99
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 8 tempo: 0.942, syllable: 0, onset: 638.99, ruleTempo: 1.000, r # Rule 42, duration: 86.40, beat: 739.28 2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 9 tempo: 0.942, syllable: 0, onset: 739.28, ruleTempo: 1.000, ll # Rule 47, duration: 89.26, beat: 846.21
2009-11-03 16:21:11.561 tst_obc[1667:10b] Posture 10 tempo: 0.942, syllable: 0, onset: 846.21, ruleTempo: 1.000, d # Rule 21, duration: 170.45, beat: 1016.62 2009-11-03 16:21:11.562 tst_obc[1667:10b] Foot 3 tempo: 0.991, marked: 1, last: 1, onset1: 0.000, onset2: 0.000
2009-11-03 16:21:11.562 tst_obc[1667:10b] Posture 11 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, t' 2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 12 tempo: 0.991, syllable: 0, onset: 1016.62, ruleTempo: 1.000, e' # Rule 24, duration: 227.35, beat: 1207.80
2009-11-03 16:21:11.563 tst_obc[1667:10b] Posture 13 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, k' 2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 14 tempo: 0.991, syllable: 0, onset: 1207.80, ruleTempo: 1.000, s' # Rule 18, duration: 202.33, beat: 1445.70
2009-11-03 16:21:11.564 tst_obc[1667:10b] Posture 15 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, t' 2009-11-03 16:21:11.565 tst_obc[1667:10b] Posture 16 tempo: 0.991, syllable: 0, onset: 1445.70, ruleTempo: 1.000, #'
2009-11-03 16:21:11.565 tst_obc[1667:10b] < printDataStructures: 2009-11-03 16:21:11.566 tst_obc[1667:10b] sample rate: 22050.000000 2009-11-03 16:21:11.567 tst_obc[1667:10b] format id: 6c70636d (lpcm) 2009-11-03 16:21:11.567 tst_obc[1667:10b] format flags: 1c
2009-11-03 16:21:11.568 tst_obc[1667:10b] bytes per packet: 2 2009-11-03 16:21:11.569 tst_obc[1667:10b] frames per packet: 1 2009-11-03 16:21:11.569 tst_obc[1667:10b] bytes per frame: 2 2009-11-03 16:21:11.570 tst_obc[1667:10b] channels per frame: 1
2009-11-03 16:21:11.570 tst_obc[1667:10b] bits per channel: 16 initializeConversion(), sampleRateConverter.maximumSampleValue: 0 2009-11-03 16:21:11.645 tst_obc[1667:10b] number of samples: 32928 2009-11-03 16:21:11.646 tst_obc[1667:10b] maximum sample value: 0.0011
2009-11-03 16:21:11.646 tst_obc[1667:10b] scale: 29971423.7842 2009-11-03 16:21:11.652 tst_obc[1667:10b] dealloc, free(inputData) Segmentation fault logout
Thank you, John Delaney
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
_______________________________________________ gnuspeech-contact mailing list gnuspeech-contact@... http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
Have a look at Linked Frameworks section in the Xcode Groups & Files pane. I've found in the past that for setting up the project on a different system, I've often had to remove the custom frameworks (Tube and GnuSpeech) and then add them again, so Xcode correctly picks up the new locations -- unless they're in standard system Framework folders. If you would like additional information on Xcode, have a look at the book "Xcode Unleashed" -- there may be others.
Also, you might just want to confirm that when you open up the build options panel (CMD-i), select the Build tab, and some ways down you will see in the "Search Paths" section the option: Framework Paths, specifically, double-click and see that it's set to "$(SRCROOT)/.." with the "recursive" switch turned on. This is set for all the individual xcode projects. This expands out to show you the actual additional paths used to search and link against. $(SRCROOT) is configured on my system to point to "../.." in the General tab of the project info panel (CMD-i). Alternatively, you can just use hard paths in specifying the header/framework paths.
If you have the frameworks installed, then you might want to go to where they are (/Library/Frameworks/GnuSpeech.framework and /Library/Frameworks/Tube.framework) and rename them (say with .original appended), and then create links in /Library/Frameworks/ to the compiled frameworks in your project directory. You only really need to do this if you think you will be modifying or experimenting with the Tube and GnuSpeech frameworks. That way apps pick up the latest changes to these frameworks, instead of using the previously installed frameworks.
Also, if you have problems, try opening up the Tube and GnuSpeech xcode projects, and build them separately. Then go back and try and build the various apps etc. that depend on them. This in the event of some odd Xcode project configuration settings.
If you've made any other changes to the build settings, I suggest downloading another copy of the distribution as a backup "reference-only" copy so you can compare against it to see if your build settings for any given project match. Accidental or experimental changes to some of these settings may result in warnings or even errors during compilation/linking.
Best, Dalmazio
On 2009-11-04, at 12:06 PM, John Delaney wrote: Ok, found the diagnostic info. I have the most recent build of GnuSpeech, and the applications do produce speech.
Also I have a question, I notice that in Xcode, rather than having the compiled frameworks, you have an xcode project of the framework. When I try to do that I get linking errors. How do you do include the uncompiled frameworks and then get them to compile? I've looked through all the settings and can't figure out the problem. Pardon my rookie questions.
On Wed, Nov 4, 2009 at 12:17 AM, Dalmazio Brisinda <dbrisinda@...> wrote:
John,
Could you try installing the latest pre-compiled beta-release version and see if the various applications generate speech? Then we can hopefully rule out any sort of system configuration issue -- at least as far as the executable apps and libs are concerned.
Also, to get the backtrace of the segmentation fault, open Console, and show the Log List. Then scroll down and you should see under the headings DIAGNOSTIC INFORMATION -> User Diagnostic Information -> <the crash file> or FILES -> ~/Library/Logs -> CrashReporter -> <the crash file>
Best, Dalmazio _______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
Thank you all for your help. I have switched to using the server method because its very easy and functional. Am I mistaken, though that many of the parameters such as pitch and intonation have not yet been implemented to the server? I am looking at the server and all the get/set methods have return zero. I suppose I will need to impliment those if this is the case.
On Wed, Nov 4, 2009 at 12:37 PM, Dalmazio Brisinda <dbrisinda@...> wrote:
Have a look at Linked Frameworks section in the Xcode Groups & Files pane. I've found in the past that for setting up the project on a different system, I've often had to remove the custom frameworks (Tube and GnuSpeech) and then add them again, so Xcode correctly picks up the new locations -- unless they're in standard system Framework folders. If you would like additional information on Xcode, have a look at the book "Xcode Unleashed" -- there may be others.
Also, you might just want to confirm that when you open up the build options panel (CMD-i), select the Build tab, and some ways down you will see in the "Search Paths" section the option: Framework Paths, specifically, double-click and see that it's set to "$(SRCROOT)/.." with the "recursive" switch turned on. This is set for all the individual xcode projects. This expands out to show you the actual additional paths used to search and link against. $(SRCROOT) is configured on my system to point to "../.." in the General tab of the project info panel (CMD-i). Alternatively, you can just use hard paths in specifying the header/framework paths.
If you have the frameworks installed, then you might want to go to where they are (/Library/Frameworks/GnuSpeech.framework and /Library/Frameworks/Tube.framework) and rename them (say with .original appended), and then create links in /Library/Frameworks/ to the compiled frameworks in your project directory. You only really need to do this if you think you will be modifying or experimenting with the Tube and GnuSpeech frameworks. That way apps pick up the latest changes to these frameworks, instead of using the previously installed frameworks.
Also, if you have problems, try opening up the Tube and GnuSpeech xcode projects, and build them separately. Then go back and try and build the various apps etc. that depend on them. This in the event of some odd Xcode project configuration settings.
If you've made any other changes to the build settings, I suggest downloading another copy of the distribution as a backup "reference-only" copy so you can compare against it to see if your build settings for any given project match. Accidental or experimental changes to some of these settings may result in warnings or even errors during compilation/linking.
Best, Dalmazio
On 2009-11-04, at 12:06 PM, John Delaney wrote: Ok, found the diagnostic info. I have the most recent build of GnuSpeech, and the applications do produce speech.
Also I have a question, I notice that in Xcode, rather than having the compiled frameworks, you have an xcode project of the framework. When I try to do that I get linking errors. How do you do include the uncompiled frameworks and then get them to compile? I've looked through all the settings and can't figure out the problem. Pardon my rookie questions.
On Wed, Nov 4, 2009 at 12:17 AM, Dalmazio Brisinda <dbrisinda@...> wrote:
John,
Could you try installing the latest pre-compiled beta-release version and see if the various applications generate speech? Then we can hopefully rule out any sort of system configuration issue -- at least as far as the executable apps and libs are concerned.
Also, to get the backtrace of the segmentation fault, open Console, and show the Log List. Then scroll down and you should see under the headings DIAGNOSTIC INFORMATION -> User Diagnostic Information -> <the crash file> or FILES -> ~/Library/Logs -> CrashReporter -> <the crash file>
Best, Dalmazio
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
Yes, you are correct. All those server methods are yet to be implemented. Currently the server just supports speaking text with the defaults that were taken from Monet. This is certainly one area that could use some filling out, and any contribution would be more than welcome.
Best, Dalmazio
On 2009-11-04, at 5:54 PM, John Delaney wrote: Thank you all for your help. I have switched to using the server method because its very easy and functional. Am I mistaken, though that many of the parameters such as pitch and intonation have not yet been implemented to the server? I am looking at the server and all the get/set methods have return zero. I suppose I will need to impliment those if this is the case.
On Wed, Nov 4, 2009 at 12:37 PM, Dalmazio Brisinda <dbrisinda@...> wrote:
Have a look at Linked Frameworks section in the Xcode Groups & Files pane. I've found in the past that for setting up the project on a different system, I've often had to remove the custom frameworks (Tube and GnuSpeech) and then add them again, so Xcode correctly picks up the new locations -- unless they're in standard system Framework folders. If you would like additional information on Xcode, have a look at the book "Xcode Unleashed" -- there may be others.
Also, you might just want to confirm that when you open up the build options panel (CMD-i), select the Build tab, and some ways down you will see in the "Search Paths" section the option: Framework Paths, specifically, double-click and see that it's set to "$(SRCROOT)/.." with the "recursive" switch turned on. This is set for all the individual xcode projects. This expands out to show you the actual additional paths used to search and link against. $(SRCROOT) is configured on my system to point to "../.." in the General tab of the project info panel (CMD-i). Alternatively, you can just use hard paths in specifying the header/framework paths.
If you have the frameworks installed, then you might want to go to where they are (/Library/Frameworks/GnuSpeech.framework and /Library/Frameworks/Tube.framework) and rename them (say with .original appended), and then create links in /Library/Frameworks/ to the compiled frameworks in your project directory. You only really need to do this if you think you will be modifying or experimenting with the Tube and GnuSpeech frameworks. That way apps pick up the latest changes to these frameworks, instead of using the previously installed frameworks.
Also, if you have problems, try opening up the Tube and GnuSpeech xcode projects, and build them separately. Then go back and try and build the various apps etc. that depend on them. This in the event of some odd Xcode project configuration settings.
If you've made any other changes to the build settings, I suggest downloading another copy of the distribution as a backup "reference-only" copy so you can compare against it to see if your build settings for any given project match. Accidental or experimental changes to some of these settings may result in warnings or even errors during compilation/linking.
Best, Dalmazio
On 2009-11-04, at 12:06 PM, John Delaney wrote: Ok, found the diagnostic info. I have the most recent build of GnuSpeech, and the applications do produce speech.
Also I have a question, I notice that in Xcode, rather than having the compiled frameworks, you have an xcode project of the framework. When I try to do that I get linking errors. How do you do include the uncompiled frameworks and then get them to compile? I've looked through all the settings and can't figure out the problem. Pardon my rookie questions.
On Wed, Nov 4, 2009 at 12:17 AM, Dalmazio Brisinda <dbrisinda@...> wrote:
John,
Could you try installing the latest pre-compiled beta-release version and see if the various applications generate speech? Then we can hopefully rule out any sort of system configuration issue -- at least as far as the executable apps and libs are concerned.
Also, to get the backtrace of the segmentation fault, open Console, and show the Log List. Then scroll down and you should see under the headings DIAGNOSTIC INFORMATION -> User Diagnostic Information -> <the crash file> or FILES -> ~/Library/Logs -> CrashReporter -> <the crash file>
Best, Dalmazio
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
Here I was trying to implement a speech synthesis API for a graduate musical synthesis class, and now I'm getting roped into actually working on the project. I'll implement some sort of Parameter class to hold the current intonation parameters, that should be pretty simple.
Would it be possible for the synthesis engine to ramp up the intonation at the end of a sentence whenever there is a question mark? I don't think I have seen a synthesis engine, yet to do this, and it seems like such a small/easy thing to do.
Perhaps I'll revisit this when I eventually take machine learning classes. Thank you, John Delaney On Wed, Nov 4, 2009 at 5:09 PM, Dalmazio Brisinda <dbrisinda@...> wrote:
Yes, you are correct. All those server methods are yet to be implemented. Currently the server just supports speaking text with the defaults that were taken from Monet. This is certainly one area that could use some filling out, and any contribution would be more than welcome.
Best, Dalmazio
On 2009-11-04, at 5:54 PM, John Delaney wrote: Thank you all for your help. I have switched to using the server method because its very easy and functional. Am I mistaken, though that many of the parameters such as pitch and intonation have not yet been implemented to the server? I am looking at the server and all the get/set methods have return zero. I suppose I will need to impliment those if this is the case.
On Wed, Nov 4, 2009 at 12:37 PM, Dalmazio Brisinda <dbrisinda@...> wrote:
Have a look at Linked Frameworks section in the Xcode Groups & Files pane. I've found in the past that for setting up the project on a different system, I've often had to remove the custom frameworks (Tube and GnuSpeech) and then add them again, so Xcode correctly picks up the new locations -- unless they're in standard system Framework folders. If you would like additional information on Xcode, have a look at the book "Xcode Unleashed" -- there may be others.
Also, you might just want to confirm that when you open up the build options panel (CMD-i), select the Build tab, and some ways down you will see in the "Search Paths" section the option: Framework Paths, specifically, double-click and see that it's set to "$(SRCROOT)/.." with the "recursive" switch turned on. This is set for all the individual xcode projects. This expands out to show you the actual additional paths used to search and link against. $(SRCROOT) is configured on my system to point to "../.." in the General tab of the project info panel (CMD-i). Alternatively, you can just use hard paths in specifying the header/framework paths.
If you have the frameworks installed, then you might want to go to where they are (/Library/Frameworks/GnuSpeech.framework and /Library/Frameworks/Tube.framework) and rename them (say with .original appended), and then create links in /Library/Frameworks/ to the compiled frameworks in your project directory. You only really need to do this if you think you will be modifying or experimenting with the Tube and GnuSpeech frameworks. That way apps pick up the latest changes to these frameworks, instead of using the previously installed frameworks.
Also, if you have problems, try opening up the Tube and GnuSpeech xcode projects, and build them separately. Then go back and try and build the various apps etc. that depend on them. This in the event of some odd Xcode project configuration settings.
If you've made any other changes to the build settings, I suggest downloading another copy of the distribution as a backup "reference-only" copy so you can compare against it to see if your build settings for any given project match. Accidental or experimental changes to some of these settings may result in warnings or even errors during compilation/linking.
Best, Dalmazio
On 2009-11-04, at 12:06 PM, John Delaney wrote: Ok, found the diagnostic info. I have the most recent build of GnuSpeech, and the applications do produce speech.
Also I have a question, I notice that in Xcode, rather than having the compiled frameworks, you have an xcode project of the framework. When I try to do that I get linking errors. How do you do include the uncompiled frameworks and then get them to compile? I've looked through all the settings and can't figure out the problem. Pardon my rookie questions.
On Wed, Nov 4, 2009 at 12:17 AM, Dalmazio Brisinda <dbrisinda@...> wrote:
John,
Could you try installing the latest pre-compiled beta-release version and see if the various applications generate speech? Then we can hopefully rule out any sort of system configuration issue -- at least as far as the executable apps and libs are concerned.
Also, to get the backtrace of the segmentation fault, open Console, and show the Log List. Then scroll down and you should see under the headings DIAGNOSTIC INFORMATION -> User Diagnostic Information -> <the crash file> or FILES -> ~/Library/Logs -> CrashReporter -> <the crash file>
Best, Dalmazio
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
One thing I've noticed is that the syntax in the speechserverprotocol is different from the notation used by the eventlist and monet, ie: notionalPitch pretonicRange pretonicLift tonicRange
tonicMovement are named differently than: speed; elasticity; intonation; voiceType; pitchOffset; volume; balance; I'm not sure which variables correspond to what, so I'll take a good first stab and try to get things working for my purposes now.
Thank you, John Delaney On Wed, Nov 4, 2009 at 6:21 PM, John Delaney <a.pet.rock@...> wrote:
Here I was trying to implement a speech synthesis API for a graduate musical synthesis class, and now I'm getting roped into actually working on the project. I'll implement some sort of Parameter class to hold the current intonation parameters, that should be pretty simple.
Would it be possible for the synthesis engine to ramp up the intonation at the end of a sentence whenever there is a question mark? I don't think I have seen a synthesis engine, yet to do this, and it seems like such a small/easy thing to do.
Perhaps I'll revisit this when I eventually take machine learning classes.
Thank you, John DelaneyOn Wed, Nov 4, 2009 at 5:09 PM, Dalmazio Brisinda <dbrisinda@...> wrote:
Yes, you are correct. All those server methods are yet to be implemented. Currently the server just supports speaking text with the defaults that were taken from Monet. This is certainly one area that could use some filling out, and any contribution would be more than welcome.
Best, Dalmazio
On 2009-11-04, at 5:54 PM, John Delaney wrote: Thank you all for your help. I have switched to using the server method because its very easy and functional. Am I mistaken, though that many of the parameters such as pitch and intonation have not yet been implemented to the server? I am looking at the server and all the get/set methods have return zero. I suppose I will need to impliment those if this is the case.
On Wed, Nov 4, 2009 at 12:37 PM, Dalmazio Brisinda <dbrisinda@...> wrote:
Have a look at Linked Frameworks section in the Xcode Groups & Files pane. I've found in the past that for setting up the project on a different system, I've often had to remove the custom frameworks (Tube and GnuSpeech) and then add them again, so Xcode correctly picks up the new locations -- unless they're in standard system Framework folders. If you would like additional information on Xcode, have a look at the book "Xcode Unleashed" -- there may be others.
Also, you might just want to confirm that when you open up the build options panel (CMD-i), select the Build tab, and some ways down you will see in the "Search Paths" section the option: Framework Paths, specifically, double-click and see that it's set to "$(SRCROOT)/.." with the "recursive" switch turned on. This is set for all the individual xcode projects. This expands out to show you the actual additional paths used to search and link against. $(SRCROOT) is configured on my system to point to "../.." in the General tab of the project info panel (CMD-i). Alternatively, you can just use hard paths in specifying the header/framework paths.
If you have the frameworks installed, then you might want to go to where they are (/Library/Frameworks/GnuSpeech.framework and /Library/Frameworks/Tube.framework) and rename them (say with .original appended), and then create links in /Library/Frameworks/ to the compiled frameworks in your project directory. You only really need to do this if you think you will be modifying or experimenting with the Tube and GnuSpeech frameworks. That way apps pick up the latest changes to these frameworks, instead of using the previously installed frameworks.
Also, if you have problems, try opening up the Tube and GnuSpeech xcode projects, and build them separately. Then go back and try and build the various apps etc. that depend on them. This in the event of some odd Xcode project configuration settings.
If you've made any other changes to the build settings, I suggest downloading another copy of the distribution as a backup "reference-only" copy so you can compare against it to see if your build settings for any given project match. Accidental or experimental changes to some of these settings may result in warnings or even errors during compilation/linking.
Best, Dalmazio
On 2009-11-04, at 12:06 PM, John Delaney wrote: Ok, found the diagnostic info. I have the most recent build of GnuSpeech, and the applications do produce speech.
Also I have a question, I notice that in Xcode, rather than having the compiled frameworks, you have an xcode project of the framework. When I try to do that I get linking errors. How do you do include the uncompiled frameworks and then get them to compile? I've looked through all the settings and can't figure out the problem. Pardon my rookie questions.
On Wed, Nov 4, 2009 at 12:17 AM, Dalmazio Brisinda <dbrisinda@...> wrote:
John,
Could you try installing the latest pre-compiled beta-release version and see if the various applications generate speech? Then we can hopefully rule out any sort of system configuration issue -- at least as far as the executable apps and libs are concerned.
Also, to get the backtrace of the segmentation fault, open Console, and show the Log List. Then scroll down and you should see under the headings DIAGNOSTIC INFORMATION -> User Diagnostic Information -> <the crash file> or FILES -> ~/Library/Logs -> CrashReporter -> <the crash file>
Best, Dalmazio
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
Hi John,
The original text-to-speech system on the NeXT, on which the port is based, did address the "question" intonation pattern.
The intonation patterns are affected by the punctuation and intonation control parameters. But, properly, only questions expecting the answer "Yes" or No", or statements expressing uncertainty that really have rising intonation at the end.
The rampant "up-talk" by the younger generation in Canada is an exception -- everything in "up-talk" gets a rising intonation at the end, perhaps a sign of insecurity in the speaker! :-).
Wh- questions don't show the rising intonation. The system did not make allowance for this distinction -- it would have required some grammatical analysis which we had not tackled, but it should be. It isn't just a matter of detecting the presence of words like "why", "when" "who", what", and "how" because it is fairly easy to frame a "Yes/No" question that also contains one or more of these words (for example: "Did you tell her when we were supposed to meet?").
The system also had regular statements and emphatic statements. There should have been a lot more, and the plan was to implement the whole of Michael Halliday's description of the intonation of British English (he wrote an excellent tutorial book, with accompanying taped examples: A course in spoken English: Intonation" -- Oxford U. Press 1970 SBN [sic] 19 453066 3).
The intonation system was tied to the metrical aspects of English described by a number of British linguists -- most notably Professor David Abercrombie who was at Edinburgh university. We carried out significant research at the U of Calgary on the rhythm and intonation of British English and this was used when we spun off Trillium Sound Research and built the original NeXT system. The rhythm and intonation were regarded as significantly effective features of the text-to-speech system, even though the research results and Halliday were only partially implemented. The speech was found to be much less tiring to listen to for long periods than, for example, DECTalk (which was based on MITalk developed at MIT: "From text to speech: the MITalk system," Allen, Hunnicutt & Klatt, Cambridge University Press, 1987 ISBN 0-521-30641-8)
Abercrombie's claim was that spoken British English had "a tendency towards isochrony". Specifically, spoken phrases and sentences could be split into "feet", rather like the bars in music, and the rhythmic "beat" falls on the first syllable of this unit (the stressed syllables dictate where the foot boundaries fall). A tendency towards isochrony then asserts that the beats fall at more regular intervals than would be expected from the differing number of syllables in each foot, and this is because the length of the syllables becomes shorter as the number increase. American linguists are skeptical about this idea but our analyses of a corpus of English spoken for purposes of illustrating intonation revealed that such a tendency definitely exists. You'd think it was an easy enough question to resolve one way or the other, but if you think this you don't know linguists! :-)
There are several descriptions of the rhythm work we did. The most complete one, though very academic, is:
but there is a shorter version that summarises the actual research data:
HILL, D.R., WITTEN I.H. and Jassem, W. (1977) Some results from a preliminary study of British English speech rhythm which was presented at 94th. Meeting of the Acoustical Society of America, Miami, Dec 12-16 but only appears as a summary in the proceedings. The full text available as U of Calgary Computer Science Dept. Report 78/26/5
I could send you a draft electronic copy as I am currently working on putting a copy on the web but there's also a hard copy version published as a departmental report.
The intonation work is best accessed through Halliday's book though Craig Taube-Schock's thesis (for which he received the Governor General of Canada's Gold Medal) reports the initial experimental work we did to validate and extend Halliday's descriptions for purposes of computer speech intonation:
"Synthesizing intonation for computer speech output" Craig-Richard Taube-Schock. M.Sc. Thesis, Department of Computer Science, The University of Calgary 1993, 109 pages.
It is available from Proquest (who archive all university theses in North America) though they have the date as 1994. In implementing the intonation for the TextToSpeech kit, a number of improvements were made that are not written up in the thesis, especially the smoothing of contours.
From the original Developer TextToSpeech kit manual:
The Parser Module takes the text supplied by the client application (using the speakText: or speakStream: methods) and converts it into an equivalent phonetic representation. The input text is parsed, where possible, into sentences and tone groups. This subdivision is done primarily by examining the punctuation. Each word or number or symbol within a tone group is converted to a phoneme string which indicates how the word is to be pronounced. The pronunciation is retrieved from one of five pronunciation knowledge bases. The Parser must also deal with text entered in any of the special text modes. For example, a word may be marked in letter mode, which means the word is to spelled out a letter at a time, or in emphasis mode, which means the word is to receive special emphasis by lengthening it and altering its pitch. The Parser marks the phonetic representation appropriately in these cases.
...
The system attempts to speak the text as a person would. Punctuation is not pronounced, but is used as guide to pronounce the text it marks. For example, a period that marks the end of sentence is not pronounced, but does indicate that a pause occurs before proceeding to the next sentence.
A question mark at the end of a sentence caused the rising intonation of a question to be selected. Another special mode allowed punctuation to be spoken, rather than used to control how the text was spoken. I have put the whole manual on my university web site where it is easier to find than digging through the savannah repository, though it doesn't really address these issues completely (but is useful for many purposes, and you will find it useful background). Go to:
Select "Published papers" from the left-hand menu, scroll down to section "E. Other publications" and you'll find a whole lot of Gnuspeech-related documents there. The sixth item is "Manual for the original NeXT Developer TextToSpeech kit". Clicking the link witll allow you to download a .pdf file of the whole manual. The five previous links in that section are also useful references for Gnuspeech and will help you in your work on porting the server.
Many thanks for your willingness to get involved. Very much appreciated. Feel free to bug me with any questions/problems that come up.
HTH. All good wishes.
david --------- David Hill -------- The only function of economic forecasting is to make astrology look respectable. (J.K. Galbraith) --------
On Nov 4, 2009, at 6:21 PM, John Delaney wrote: Here I was trying to implement a speech synthesis API for a graduate musical synthesis class, and now I'm getting roped into actually working on the project. I'll implement some sort of Parameter class to hold the current intonation parameters, that should be pretty simple. Would it be possible for the synthesis engine to ramp up the intonation at the end of a sentence whenever there is a question mark? I don't think I have seen a synthesis engine, yet to do this, and it seems like such a small/easy thing to do. Perhaps I'll revisit this when I eventually take machine learning classes.
Thank you, John Delaney
On Wed, Nov 4, 2009 at 5:09 PM, Dalmazio Brisinda <dbrisinda@...> wrote: Yes, you are correct. All those server methods are yet to be implemented. Currently the server just supports speaking text with the defaults that were taken from Monet. This is certainly one area that could use some filling out, and any contribution would be more than welcome.
Best, Dalmazio
On 2009-11-04, at 5:54 PM, John Delaney wrote: Thank you all for your help. I have switched to using the server method because its very easy and functional. Am I mistaken, though that many of the parameters such as pitch and intonation have not yet been implemented to the server? I am looking at the server and all the get/set methods have return zero. I suppose I will need to impliment those if this is the case.
On Wed, Nov 4, 2009 at 12:37 PM, Dalmazio Brisinda <dbrisinda@...> wrote: Have a look at Linked Frameworks section in the Xcode Groups & Files pane. I've found in the past that for setting up the project on a different system, I've often had to remove the custom frameworks (Tube and GnuSpeech) and then add them again, so Xcode correctly picks up the new locations -- unless they're in standard system Framework folders. If you would like additional information on Xcode, have a look at the book "Xcode Unleashed" -- there may be others.
[snip]
---------
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|

|
Re: GNUSpeech Console Utility
Hi John,
From memory: On Nov 4, 2009, at 8:42 PM, John Delaney wrote: One thing I've noticed is that the syntax in the speechserverprotocol is different from the notation used by the eventlist and monet, ie: notionalPitch Setting a base pitch pretonicRange How much pitch movement occurs *before* reaching the tonic foot in the tone group being processed -- see last email pretonicLift How much the pre-tonic feet are raised tonicRange tonicMovement How much pitch movement occurs in the tonic
These relate to the intonation contour as set up for Monet. You need a little understanding of the Halliday basis of the intonation are named differently than: speed; To do with the rate of speech elasticity; Obsolete parameter. intonation; Is intonation applied or not voiceType; Male/female/child pitchOffset; Difference in pitch from the default pitch volume; The volume and balance have the obvious meaning balance;
These relate to the set-up for the TextToSpeech Server You may find the Developer_TextToSpeech_Manual.pdf that I referenced in my last email helps you understand some of the Server parameters. Looking at the intonation control in Monet will help you understand the intonation contour parameters used in experiments with Monet and not really part of the Server adjustments as far as the user is concerned.
david
I'm not sure which variables correspond to what, so I'll take a good first stab and try to get things working for my purposes now. Thank you, John Delaney
On Wed, Nov 4, 2009 at 6:21 PM, John Delaney <a.pet.rock@...> wrote: Here I was trying to implement a speech synthesis API for a graduate musical synthesis class, and now I'm getting roped into actually working on the project. I'll implement some sort of Parameter class to hold the current intonation parameters, that should be pretty simple. Would it be possible for the synthesis engine to ramp up the intonation at the end of a sentence whenever there is a question mark? I don't think I have seen a synthesis engine, yet to do this, and it seems like such a small/easy thing to do. Perhaps I'll revisit this when I eventually take machine learning classes.
Thank you, John DelaneyOn Wed, Nov 4, 2009 at 5:09 PM, Dalmazio Brisinda <dbrisinda@...> wrote: Yes, you are correct. All those server methods are yet to be implemented. Currently the server just supports speaking text with the defaults that were taken from Monet. This is certainly one area that could use some filling out, and any contribution would be more than welcome.
Best, Dalmazio
On 2009-11-04, at 5:54 PM, John Delaney wrote: Thank you all for your help. I have switched to using the server method because its very easy and functional. Am I mistaken, though that many of the parameters such as pitch and intonation have not yet been implemented to the server? I am looking at the server and all the get/set methods have return zero. I suppose I will need to impliment those if this is the case.
On Wed, Nov 4, 2009 at 12:37 PM, Dalmazio Brisinda <dbrisinda@...> wrote: Have a look at Linked Frameworks section in the Xcode Groups & Files pane. I've found in the past that for setting up the project on a different system, I've often had to remove the custom frameworks (Tube and GnuSpeech) and then add them again, so Xcode correctly picks up the new locations -- unless they're in standard system Framework folders. If you would like additional information on Xcode, have a look at the book "Xcode Unleashed" -- there may be others.
Also, you might just want to confirm that when you open up the build options panel (CMD-i), select the Build tab, and some ways down you will see in the "Search Paths" section the option: Framework Paths, specifically, double-click and see that it's set to "$(SRCROOT)/.." with the "recursive" switch turned on. This is set for all the individual xcode projects. This expands out to show you the actual additional paths used to search and link against. $(SRCROOT) is configured on my system to point to "../.." in the General tab of the project info panel (CMD-i). Alternatively, you can just use hard paths in specifying the header/framework paths.
If you have the frameworks installed, then you might want to go to where they are (/Library/Frameworks/GnuSpeech.framework and /Library/Frameworks/Tube.framework) and rename them (say with .original appended), and then create links in /Library/Frameworks/ to the compiled frameworks in your project directory. You only really need to do this if you think you will be modifying or experimenting with the Tube and GnuSpeech frameworks. That way apps pick up the latest changes to these frameworks, instead of using the previously installed frameworks.
Also, if you have problems, try opening up the Tube and GnuSpeech xcode projects, and build them separately. Then go back and try and build the various apps etc. that depend on them. This in the event of some odd Xcode project configuration settings.
If you've made any other changes to the build settings, I suggest downloading another copy of the distribution as a backup "reference-only" copy so you can compare against it to see if your build settings for any given project match. Accidental or experimental changes to some of these settings may result in warnings or even errors during compilation/linking.
Best, Dalmazio
On 2009-11-04, at 12:06 PM, John Delaney wrote: Ok, found the diagnostic info. I have the most recent build of GnuSpeech, and the applications do produce speech. Also I have a question, I notice that in Xcode, rather than having the compiled frameworks, you have an xcode project of the framework. When I try to do that I get linking errors. How do you do include the uncompiled frameworks and then get them to compile? I've looked through all the settings and can't figure out the problem. Pardon my rookie questions.
On Wed, Nov 4, 2009 at 12:17 AM, Dalmazio Brisinda <dbrisinda@...> wrote: John,
Could you try installing the latest pre-compiled beta-release version and see if the various applications generate speech? Then we can hopefully rule out any sort of system configuration issue -- at least as far as the executable apps and libs are concerned.
Also, to get the backtrace of the segmentation fault, open Console, and show the Log List. Then scroll down and you should see under the headings DIAGNOSTIC INFORMATION -> User Diagnostic Information -> <the crash file> or FILES -> ~/Library/Logs -> CrashReporter -> <the crash file>
Best, Dalmazio
_______________________________________________ gnuspeech-contact mailing list
_______________________________________________
gnuspeech-contact mailing list
gnuspeech-contact@...
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact
|