[CLOSED] [APP][4.1+][V1.3.7] Saiy | Your Open Source Virtual Assistant

mahootzki · Apr 12, 2012

As they say, you don't get a second chance for a first impression...
While I got an email for your response, I was playing with the app Rom Toolbox and thought, there is an app you can take some pointers from.
It's not the prettiest app, but he somehow was able to do the ICS swipe to switch between tabs and so forth.
I would tell you to check it out if you can.

Sent from a..... Who cares, it's here now!!!

brandall · Apr 12, 2012

Wifi Instruction

Hi folks, I've very nearly finished the first tab and it will be to control switching wifi on and off.

Nothing ground breaking, but there's quite a bit of logic underneath it which I can build on for other functions and therefore needs testing...

Your mission (should you choose to accept it), is to try and break it! Let's see how the logic stands up when you attempt to confuse it...

Overview
As demonstrated in the YouTube video, I have commands that when recognised lead to possible sub-commands.

I'm going to initially stick to this method as it greatly improves accuracy. The downside however, is it immediately moves away from 'natural speech'.

To change the wifi state, you have to FIRST say, 'Android System' - when this is detected, the sub-commands become available, of which the wifi controller is one.

Then say whatever natural wording you like that contains "on" or "off" and "wifi".

In addition, just to test out RoboJoe's suggestion of wanting utter! to ONLY perform the requested task with no further interaction, I've added logic to look for the Americanism 'period' at the end of a command.

So to clarify:

"Android system, turn on wifi, period"

Android system = search in System sub-commands
on wifi = wifi on
off wifi = wifi off
period = no further suggestion/interaction

Hope that all makes sense...

I'm going to initially post the apk at the end of this thread, as it contains some of the code that the Config Tab didn't like - I hope I've corrected it, but I don't want it on the Play Store until I've confirmed!

I'll be back in a couple of hours!

EDIT: I was having a slow moment - I cannot of course use 'period' as some (all?) voice engines use that to denote a full stop... Period has now been changed to 'zip' - as in 'zip-it'...

brandall · Apr 12, 2012

mahootzki said:
As they say, you don't get a second chance for a first impression...

Absolutely...

While I got an email for your response, I was playing with the app Rom Toolbox and thought, there is an app you can take some pointers from.
It's not the prettiest app, but he somehow was able to do the ICS swipe to switch between tabs and so forth.
I would tell you to check it out if you can.

I'll check it out, thank you - I need all the help I can get with it!

ironlood · Apr 12, 2012

I like the conversation of setting the alarm. It would be awesome if after all the long dialog it ask

Utter: would you like to save this alarm set up

Me: yes I would

Utter: what would you like to name it

Me: work week

Utter: okay I've saved your alarms under the tag work week

The next time you set alarms you would say

Me: utter please set alarms for work week

Utter: okay I've recalled your preciously saved tag work week and all your alarms are set... Would you like to modify any of the alarms?

Pow, right in the childhood

99problemz · Apr 12, 2012

ironlood said:
I like the conversation of setting the alarm. It would be awesome if after all the long dialog it ask

Utter: would you like to save this alarm set up

Me: yes I would

Utter: what would you like to name it

Me: work week

Utter: okay I've saved your alarms under the tag work week

The next time you set alarms you would say

Me: utter please set alarms for work week

Utter: okay I've recalled your preciously saved tag work week and all your alarms are set... Would you like to modify any of the alarms?

Pow, right in the childhood

Some futuristic freakiness going on there but me likey me likey

Sent from my Galaxy Nexus using Tapatalk 2

brandall · Apr 13, 2012

ironlood said:
I like the conversation of setting the alarm. It would be awesome if after all the long dialog it ask

Utter: would you like to save this alarm set up

Me: yes I would

Utter: what would you like to name it

Me: work week

Utter: okay I've saved your alarms under the tag work week

The next time you set alarms you would say

Me: utter please set alarms for work week

Utter: okay I've recalled your preciously saved tag work week and all your alarms are set... Would you like to modify any of the alarms?

Me likey likey too! I'll have to come up with a way to establish when something could be 'repeated' or a 'template' of kinds... Great idea though

Thanks +1

freak4dell · Apr 13, 2012

Brandall...I just have one thing to say regarding saving settings. If you do allow for that, please make it easy for users to backup those settings and restore them. It doesn't have to be automated through the app, but at least manually through a file we can just copy. For people who flash ROMs a lot or have to erase their SD card for whatever reason, it's really annoying to have to setup all the configuration again on some apps, especially ones that have a multitude of settings like utter! eventually will.

brandall · Apr 13, 2012

freak4dell said:
Brandall...I just have one thing to say regarding saving settings. If you do allow for that, please make it easy for users to backup those settings and restore them. It doesn't have to be automated through the app, but at least manually through a file we can just copy. For people who flash ROMs a lot or have to erase their SD card for whatever reason, it's really annoying to have to setup all the configuration again on some apps, especially ones that have a multitude of settings like utter! eventually will.

Damn right - It will definitely be in there as it's something I find very frustrating...

For rooted devices they'll be some additional ROM and Kernel features and detection when they are changed and updated.

brandall · Apr 13, 2012

Version 1.4A

Hi folks, attached is the version with the WiFi tab enabled. I've rushed the intro speech a little as I'm tired... but posted anyway.

Let me know that it doesn't force close please (!) and see if you can beat the logic!

I'm going to grab a few hours sleep - if all is well when I return, I'll upload to the first page and Play Store.

Thanks in advance

EDIT: There is a bug as I couldn't drop the WiFi receiver, but it shouldn't cause any issue. I'll fix later.

EDIT2: REMOVED due to bug detecting WiFi state. Fixing...

vcallas · Apr 13, 2012

Looks Promising.

I had no trouble getting nutter to recognize and execute the command to turn off Wifi though I could not get it to turn Wifi back on. It always told me that Wifi was already on. Did notice at times I paused to long between saying "Android System" and issuing the next command so nutter only recognized the "Android System" and not what followed.

Altered the words I said between "Android System" and the Wifi command and as long as I did not have to long of a pause recognized my intention for Wifi, even if it would not turn back on.

Did forget to include the zip command frequently, just was not natural.

ICS KANG build 31 on Samsung Captivate.

freak4dell · Apr 13, 2012

Okay...did some brief testing.

It recognizes the commands, and executes them properly. I did notice that if I didn't speak clearly, it would think that I told it both "on" and "off", but that's not really your fault. I tend to speak in a mumble sometimes.

I'm assuming that there is no further interaction programmed in, even though it tells me that it would like to interact further.

My constructive criticism:

1. I know this is probably something that will come later, but just in case, I'd like to hear less talking. Right now, utter! basically repeats what I asked (not in the exact words, but it says "I detected the string blah blah blah and I will blah blah blah"). My opinion is that it would be better if it just said something like, "WiFi has been enabled," or, "Okay, I've turned on WiFi for you."

2. I do like the idea of a keyword to make sure it just does the task it's asked to and nothing more. I'm glad I listened to the intro all the way through, because otherwise I would have sat there saying "period" instead of "zip".

3. Again, I think you mentioned this as something that was to be done later, but it would be nice not to have to say "Android System". Ideally, it would be nice to not have to say any sort of pre-string at all. Basically, once I hit the button, I should be able to say "Turn on WiFi," and that's that. Once the voice trigger is implemented, I should be able to just say "Utter!, turn on WiFi," without hitting any buttons at all (but yes, I know that's way down the line).

Please know, I'm not a developer, so I don't know the coding difficulty of the stuff I'm suggesting. Please don't take my comments as demands, but rather, just suggestions to be done if possible.

Lastly, this may just be me doing something wrong, but my phone has a crisis whenever I want to have utter! turn on WiFi. I don't have a data plan right now (don't feel like paying for GPRS data on this island). But, it appears that Voice Search always needs a data connection, because if WiFi is turned off and I trigger Voice Search, it just says it can't connect and won't do anything further. So, I can't actually ask utter! to turn on WiFi, because I need a connection to do so in the first place. Am I an idiot that's doing something wrong, or is this an inherent flaw in Google's Voice Search app?

99problemz · Apr 13, 2012

I may be wrong on this but I believe with any voice search app you will need to have some sort of data connection mobile data or wifi. In your case since you only have wifi then it should work when it's enabled. Disabling it will only give you the connection server error.

Edit: Tested two other voice apps with data and wifi off and it's a no go. Just my 2¢ on the matter.

Sent from my Galaxy Nexus using Tapatalk 2

mistermentality · Apr 13, 2012

99problemz said:
I may be wrong on this but I believe with any voice search app you will need to have some sort of data connection mobile data or wifi. In your case since you only have wifi then it should work when it's enabled. Disabling it will only give you the connection server error.

Edit: Tested two other voice apps with data and wifi off and it's a no go. Just my 2¢ on the matter.

Sent from my Galaxy Nexus using Tapatalk 2

That's correct, most voice recognition software uses google which sends the data to their servers to be decoded then back to your device.

I'm not sure there is any free offline speech recognition engine that could be used but if there was utter would have a big speed advantage over other apps including Siri which sends its data to apples servers.

Dave

Sent from my LG P920 using Tapatalk

mistermentality · Apr 13, 2012

Regarding the post on not having to say android system in order to access wifi, it would be possible for the app to have preset phrases such as "turn wifi on" saved and if it finds such phrase act as if the full command had been issued.

For example if someone says "turn on wifi" or just "wifi on" it could quickly check a small built in database or even external flat text file and see that they mean something specific and act accordingly.

This would not create a noticeable delay as long as only common commands were pre stored.

Dave

Sent from my LG P920 using Tapatalk

freak4dell · Apr 13, 2012

99problemz said:
I may be wrong on this but I believe with any voice search app you will need to have some sort of data connection mobile data or wifi. In your case since you only have wifi then it should work when it's enabled. Disabling it will only give you the connection server error.

Edit: Tested two other voice apps with data and wifi off and it's a no go. Just my 2¢ on the matter.

Sent from my Galaxy Nexus using Tapatalk 2

mistermentality said:
That's correct, most voice recognition software uses google which sends the data to their servers to be decoded then back to your device.

I'm not sure there is any free offline speech recognition engine that could be used but if there was utter would have a big speed advantage over other apps including Siri which sends its data to apples servers.

Dave

Sent from my LG P920 using Tapatalk

Okay, that's what I thought, and I find this utterly annoying. I can understand that for a lot of commands, it's easier to do the processing server side, but there's no excuse for not having simple things just processed on the phone. In this particular case, it's not even Voice Search that's processing the command...it's utter! Voice Search should simply pass the data onto utter! and let utter! whatever utter! wants to do with it. For this particular task, since it's encoded into the program, utter could handle it without a data connection at all, yet it's being held at the mercy of Voice Search.

Absolutely stupid.

/rant.

Futur Innovations · Apr 13, 2012

Absolutely no issues for me

I was even able save the sound to the sdcard and set them as notifications

Sent from my Galaxy Nexus using Tapatalk 2

codesplice · Apr 13, 2012

freak4dell said:
Okay, that's what I thought, and I find this utterly annoying. I can understand that for a lot of commands, it's easier to do the processing server side, but there's no excuse for not having simple things just processed on the phone. In this particular case, it's not even Voice Search that's processing the command...it's utter! Voice Search should simply pass the data onto utter! and let utter! whatever utter! wants to do with it. For this particular task, since it's encoded into the program, utter could handle it without a data connection at all, yet it's being held at the mercy of Voice Search.

Absolutely stupid.

/rant.

The commands wouldn't be processed by Voice Search, but converting your speech into text is handled by Google's Voice Search engine. The actual processing of the raw audio data is done on Google's servers. Then the text commands are sent back to the device and then utter will process those commands. The alternative is to code a complete offline speech processing engine, which would be quite an undertaking. I'm not sure there are any pre-packaged systems that do it, but I suppose one could compile a build of PocketSphynx for Android: http://cmusphinx.sourceforge.net/2011/05/building-pocketsphinx-on-android/

brandall · Apr 13, 2012

vcallas said:
I had no trouble getting nutter to recognize and execute the command to turn off Wifi though I could not get it to turn Wifi back on. It always told me that Wifi was already on. Did notice at times I paused to long between saying "Android System" and issuing the next command so nutter only recognized the "Android System" and not what followed.

Altered the words I said between "Android System" and the Wifi command and as long as I did not have to long of a pause recognized my intention for Wifi, even if it would not turn back on.

Did forget to include the zip command frequently, just was not natural.

Thank you - made me spot a bug in the WiFi receiver. Fixed now and new apk at the bottom of this post.

I can add a setting to increase the 'pause' level before the recogniser assumes you've stopped speaking.

'Zip' isn't naturally - agreed. Just wanted to test the theory of it. Anything that moves away from natural speech isn't a good thing, but not being able to stop utter! from trying to be helpful could be worse... I need to ponder over how to approach this in the future.

freak4dell said:
Okay...did some brief testing.

It recognizes the commands, and executes them properly. I did notice that if I didn't speak clearly, it would think that I told it both "on" and "off", but that's not really your fault. I tend to speak in a mumble sometimes.

I'm assuming that there is no further interaction programmed in, even though it tells me that it would like to interact further.

No further logic as yet.

My constructive criticism:

1. I know this is probably something that will come later, but just in case, I'd like to hear less talking. Right now, utter! basically repeats what I asked (not in the exact words, but it says "I detected the string blah blah blah and I will blah blah blah"). My opinion is that it would be better if it just said something like, "WiFi has been enabled," or, "Okay, I've turned on WiFi for you."

I have to assume new users will find this novel, however I understand this is getting annoying otherwise - Imagine how many times I've heard it! Right now I want as much explanation in there as possible, just in

utter - utter! won't be the same.

2. I do like the idea of a keyword to make sure it just does the task it's asked to and nothing more. I'm glad I listened to the intro all the way through, because otherwise I would have sat there saying "period" instead of "zip".

Ha! I edited that post when I released 'period' placed a full stop...

3. Again, I think you mentioned this as something that was to be done later, but it would be nice not to have to say "Android System". Ideally, it would be nice to not have to say any sort of pre-string at all. Basically, once I hit the button, I should be able to say "Turn on WiFi," and that's that. Once the voice trigger is implemented, I should be able to just say "Utter!, turn on WiFi," without hitting any buttons at all (but yes, I know that's way down the line).

Please know, I'm not a developer, so I don't know the coding difficulty of the stuff I'm suggesting. Please don't take my comments as demands, but rather, just suggestions to be done if possible.

Your feedback is helpful and appreciated. I'll write a bit about the 'Android System' style sub-commands below

Lastly, this may just be me doing something wrong, but my phone has a crisis whenever I want to have utter! turn on WiFi. I don't have a data plan right now (don't feel like paying for GPRS data on this island). But, it appears that Voice Search always needs a data connection, because if WiFi is turned off and I trigger Voice Search, it just says it can't connect and won't do anything further. So, I can't actually ask utter! to turn on WiFi, because I need a connection to do so in the first place. Am I an idiot that's doing something wrong, or is this an inherent flaw in Google's Voice Search app?

It's not you, it needs a data connection. I'll write below about Sphinx

mistermentality said:
That's correct, most voice recognition software uses google which sends the data to their servers to be decoded then back to your device.

I'm not sure there is any free offline speech recognition engine that could be used but if there was utter would have a big speed advantage over other apps including Siri which sends its data to apples servers.

99problemz said:
I may be wrong on this but I believe with any voice search app you will need to have some sort of data connection mobile data or wifi. In your case since you only have wifi then it should work when it's enabled. Disabling it will only give you the connection server error.

Edit: Tested two other voice apps with data and wifi off and it's a no go. Just my 2¢ on the matter.

Sent from my Galaxy Nexus using Tapatalk 2

mistermentality said:
Regarding the post on not having to say android system in order to access wifi, it would be possible for the app to have preset phrases such as "turn wifi on" saved and if it finds such phrase act as if the full command had been issued.

For example if someone says "turn on wifi" or just "wifi on" it could quickly check a small built in database or even external flat text file and see that they mean something specific and act accordingly.

This would not create a noticeable delay as long as only common commands were pre stored.

freak4dell said:
Okay, that's what I thought, and I find this utterly annoying. I can understand that for a lot of commands, it's easier to do the processing server side, but there's no excuse for not having simple things just processed on the phone. In this particular case, it's not even Voice Search that's processing the command...it's utter! Voice Search should simply pass the data onto utter! and let utter! whatever utter! wants to do with it. For this particular task, since it's encoded into the program, utter could handle it without a data connection at all, yet it's being held at the mercy of Voice Search.

Absolutely stupid.

/rant.

Futur Innovations said:
Absolutely no issues for me I was even able save the sound to the sdcard and set them as notifications

Good work!

codesplice said:
The commands wouldn't be processed by Voice Search, but converting your speech into text is handled by Google's Voice Search engine. The actual processing of the raw audio data is done on Google's servers. Then the text commands are sent back to the device and then utter will process those commands. The alternative is to code a complete offline speech processing engine, which would be quite an undertaking. I'm not sure there are any pre-packaged systems that do it, but I suppose one could compile a build of PocketSphynx for Android: http://cmusphinx.sourceforge.net/2011/05/building-pocketsphinx-on-android/

I would love to integrate Sphinx, but there is limited documentation and it's a time thing too... I REALLY want to have a go at it though... This YouTube video demonstrates how it could be used.

I also want to implement voice typing, but without having to select a separate IME everytime... Again, it's a time thing...

Regarding the key-phrases such as 'Android System'. Right now, this isn't necessary as there are a limited number of key-phrases to search through. I could easily change the apk below to work with just 'turn on wifi'.

The problem will come when there are 50 different in-built key-phrases, with the potential for unlimited user-created sub-commands and then the same again with user-created key-phrases and their related sub-commands.

The phrases will be handled in a database and I'm pretty good at sql, so I hope I can make it as quick as possible, but right now I've no idea how quick that could be.

Rather than needing to revert to key-phrases set-up later if there are speed issues, I'm building the database so I can simply remove their requirement if there's no problem - I'm mindful of it...

Anyway, updated apk below with the wifi receiver fixed. Hope it's fool-proof now!

EDIT: APK REMOVED DUE TO WIFI PERMISSION BUG

vcallas · Apr 13, 2012

Just tried the update and now is working without issue.

I am amazed by how responsive you are to all of the posts on here.

Sent from my SGH-I897 using XDA

freak4dell · Apr 13, 2012

codesplice said:
The commands wouldn't be processed by Voice Search, but converting your speech into text is handled by Google's Voice Search engine. The actual processing of the raw audio data is done on Google's servers. Then the text commands are sent back to the device and then utter will process those commands. The alternative is to code a complete offline speech processing engine, which would be quite an undertaking. I'm not sure there are any pre-packaged systems that do it, but I suppose one could compile a build of PocketSphynx for Android: http://cmusphinx.sourceforge.net/2011/05/building-pocketsphinx-on-android/

Ah, that makes sense. That's my mistake. I figured utter! was doing the processing of it all.

I still think that 100% net-dependent voice search is stupid (modern phones should have the power to handle at least some simple tasks without the internet), but I can understand why more complex tasks require server-side processing. Anyway, I'll stop here, since this isn't really utter! related.

brandall, like I said, when I suggest things, I'm doing it from a "what would be convenient for me, and maybe other general users" perspective. I appreciate your explanations on some of the things I brought up, and I think you're doing the right thing by spending time on the core functionality of utter! first and putting things like Sphinx off until later.

I really do want to commend you for the way you are handling this. Some would say that development is going slow, but it's very clear that you are dedicating a lot to this, and rather than pushing out half-assed stuff, you're taking your time to make sure that each update is high quality. You're also making sure to cover all your bases the first time around. I may not know much about developing, but that habit makes the future of everything easier.

[CLOSED] [APP][4.1+][V1.3.7] Saiy | Your Open Source Virtual Assistant

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Senior Member

Similar threads

Top Liked Posts