[CLOSED] [APP][4.1+][V1.3.7] Saiy | Your Open Source Virtual Assistant

Status
Not open for further replies.
Search This thread

krzych

Senior Member
Jul 29, 2009
1,141
301
Warsaw
As far as speed goes....In comparison to the multitudes of other voice assisted apps that I have tried, Utter moves very quickly. If you think it can be improved that would be monumental. The only time that I notice much lag depends on internet connection. Obviously an offline engine would be the fastest, but I can't imagine what it would take to develop.:fingers-crossed::fingers-crossed:
I never said it's slow - the opposite - it's very fast. I just said that when you define an app to run, it scans every possible activity, and then this scan takes long :). Nothing more :)

And you didn't see my backup app yet, so you don't know the speed yet ;) heheh ;)
 

GLDavis

Senior Member
Sep 25, 2012
998
845
Virginia Beach
I never said it's slow - the opposite - it's very fast. I just said that when you define an app to run, it scans every possible activity, and then this scan takes long :). Nothing more :)

And you didn't see my backup app yet, so you don't know the speed yet ;) heheh ;)

Ah...hah, okay. Yes it's pretty speedy.

Waiting then for the "backup app"...another goody to play with!:laugh:
 

brandall

Senior Member
May 16, 2008
2,679
5,222
Birmingham
Hi folks, sorry for my absence - finally emerged from the Play Store bugs and think I've pretty much got them. In between times I've been working on some new features too, some of which I think you'll like :D

I'll need to let you folks have a tinker with it before I upload to the Play Store - really hoping within the next 24 hours!
Yeah, you can open apps and make calendar entries, but it is pretty basic. You have to speak the exact words to get it to start (which I guess is the same for utter as of now). But I'm confident that you'll make it less robotic to do so, and you have plenty of time until the next Android OS (and that's even if Google does release their voice assistant then).
Good to know even with their resource they aren't leaving me behind! When you say 'voice assistant' you make it sound like another app apart from Google Now ? Is that what you mean?
I'm curious because I'm nosey; what is the highest AI level Utter will report at present?

Mine did say 1.04 until a recent patch when it dropped to 1.035. I don't use Tasker but imagine that integrating with Tasker would increase the reported AI level significantly...?

I'm not asking because I'm worried that I'm missing anything at present; I know things are going to get a whole lot more intelligent some day soon. :)

ps. Can you make it so that a FB status update or email body/subject begins with a capital letter? :)

Also, I'm struggling with some punctuation:

"Question mark" works
"Exclamation mark" works
"Comma" works

"Full stop" doesn't work ( I haven't tested this but... please don't make me say "period" :p )
"dash" or "hyphen" doesn't work

And can we have a 'new line' or 'carriage return' command?

Also, (sorry for bombarding) does the voice recognition improve as we use it more? (I'm using ICS if that matters)

Sent from my ARCHOS 101 G9 16GB Turbo using XDA app
Thanks for all of these suggestions and yes to them all! I get rid of the punctuation etc when I'm searching for matches, so I need to add it back in after. I'll have a ponder about new paragraph etc too. Also, in two releases time there will be an option to increase the pause timeout, so you don't have to worry so much about taking a long breath! I'll add full stop in too - 'period' is just not very English!

The AI level isn't working properly, thanks for making me double check! I can't actually remember what the highest level possible would be on the last release, as mine is updated now and I'd have to recompile, load everything etc... When I fix it, I'll you know how you hit 'power user' status.

Google Voice Search improves the more you use it - although it also gets worse too! Returns off whenever I say on and puts text and call at the beginning of so many sentences for no reason at all.... I'm having to increasingly reverse-algorithm these. Another couple of solutions are coming soon too :D
I was a little curious myself about the AI level. I was at 1.04 and dropped to 1.03. I use tasker so I would think think that would up my score?:eek:
It's supposed to! :eek:
Hi Brandall do you want my help on translating whole app to Portuguese (Brazil)?
Thanks for the offer and yes please! Not right now though... It's a huge task for me to prepare the list of what needs to be translated and in what context. I won't be too long though. Thanks again for the offer. :good:
As far as speed goes....In comparison to the multitudes of other voice assisted apps that I have tried, Utter moves very quickly. If you think it can be improved that would be monumental. The only time that I notice much lag depends on internet connection. Obviously an offline engine would be the fastest, but I can't imagine what it would take to develop.:fingers-crossed::fingers-crossed:
:fingers-crossed: too! I'll build it in somehow... As far as speed goes, every command is lightening quick for me, on test devices and emulator, so if anyone does have a command that is slow to process, please do bring this to my attention!
My AI level is 1.0 and I feel real dumb. I've tried to increase AI level by linking my Tasker with utter!, but I can't link Tasker. It says "There was error communicating with Tasker"

Also I can't enable Shake-to-Wake, Wave-to-Wake, and Visual Results. I don't know if Tasker isn't understanding my 'enable' or it doesn't have this feature.

I can't use 'spell' it doesn't recognize this word. When I try to use 'define', it says this feature is not there yet.
The Tasker issue is a known Android problem where an application does not recognise the permission to use it, if it's installed after the application wanting permission! Not sure if it's fixed in Jelly Bean? You'd have to uninstall utter! and then re-install it to get Tasker to find the permissions. The bug is stared a lot of times from my post here and I've asked the dev of Tasker to put a work-around in for this.

There is the option to enable shake and wave in the settings menu. Does it work then? The command to turn it on is 'enable visual results' - does it not work? 'Define' isn't quite ready yet - need to build in Wikipedia (and some others) support. Solution for it not recognising 'spell' coming in the next release!
[SNIP]The only thing I have not gotten to work is the Wave to Wake feature.
Still haven't worked out how I can demonstrate the 'rhythm' other than a very odd YouTube video!
Will be right here soon enough, finishing, polishing, changing UI, and voila! ;)
I'll look forward to checking it out! :victory:

Thanks once again for all feedback, suggestions and reports folks!

I think you're going to like the next release :D
 

brandall

Senior Member
May 16, 2008
2,679
5,222
Birmingham
hey brandall, absolutely loving the app, you're doing a fantastic job with it.

there's been a couple of situations recently where i've wanted to sent a message while driving and it's worked perfectly right up to the point of proof reading the message but after that does it hand control back to the user or is there a way utter can ask again whether it's ok to send the message?

also, if you have a pin lock on your phone is there a way after turning the screen on that you can speak the pin to unlock your phone?

keep up the great work.
Sorry, I missed this post - Auto-sending should now be enabled? Is it not working for you?

On the subject of text-messages, so users have reported that utter! doesn't read the last message, but an older one. The only reason I can think of that this would be happening is if another application is intercepting the message first and then not playing fair by releasing it to other apps afterwards.

For pin lock, it could only be for rooted devices. I hadn't considered building a voice-to-unlock lock-screen of my own, but probably best to wait until offline recognition is available!!! :D

In the next release, you will be able to do it using Tasker, the secure settings plugin and utter! If there's no data connection, you can still rely on your good old fashioned fingers! They are so 1980's... :D

EDIT: For back page arrivals, the latest test build is actually on the Play Store here
 

99problemz

Senior Member
May 13, 2011
174
54
Wohooo!! been waiting to talk dirty to kendra :p just wanted to say B. Utter has come a long way and I'm glad you have included us on this fantastic voyage keep it coming :thumbup:
 
  • Like
Reactions: brandall

krzych

Senior Member
Jul 29, 2009
1,141
301
Warsaw
I'll look forward to checking it out! :victory:
Heheh, you too? :) Ok, you'll be first to notify :). I think I can do it in one week or so, as it really works already :)

Thanks once again for all feedback, suggestions and reports folks!

I think you're going to like the next release :D
There was no release we wouldn't like :D. I won't participate in the next 24hrs for sure, as I need winter tires, and this requires a trip, and be back in one day :).

Btw. who has the snow already? ;P We have it, and it's not cool :D
 
  • Like
Reactions: brandall

GLDavis

Senior Member
Sep 25, 2012
998
845
Virginia Beach
It doesn't ask me to send the message either. It hands over the application back to the user once you accept the proofread option and requires a manual send. :eek:
If you choose not to proofread then it sends the message as it should. :)

Sent from my PC36100 using xda app-developers app
 
  • Like
Reactions: brandall

IAmAN00bie

Senior Member
Dec 2, 2010
629
335
Yep, there's been rumors for a long time that Google is working on a full-fledged voice assistant. Google Voice Search is slowly creeping into that territory, and I have no doubt that eventually it will become just that.
 
  • Like
Reactions: brandall

krzych

Senior Member
Jul 29, 2009
1,141
301
Warsaw
Yep, there's been rumors for a long time that Google is working on a full-fledged voice assistant. Google Voice Search is slowly creeping into that territory, and I have no doubt that eventually it will become just that.
Google won't make it. I mean, they will, but it will be same crap as the market (vel. play) and similar. Google is good only with new ideas, but of course, they have the development power... if only a smart, conscious person would lead them... :) (and wouldn't look for money so much :p)
 
  • Like
Reactions: brandall

nerys71

Senior Member
May 13, 2008
2,107
422
will there be a way to run utter "live" from the lock screen? so I can just speak the "activate" word and then give a command without having to touch the phone? anyway to run it live "ALL" the time whenever its plugged into power or something like that?

I can just say "computer" it chimes and then give my command?
 
  • Like
Reactions: brandall

ironlood

Senior Member
Jan 31, 2010
614
116
Antioch
Google won't make it. I mean, they will, but it will be same crap as the market (vel. play) and similar. Google is good only with new ideas, but of course, they have the development power... if only a smart, conscious person would lead them... :) (and wouldn't look for money so much :p)

Google has done a lot with the TTS and Voice recognition realm... they bought a company that specialized in TTS video game voice overs... from a text file it would generate almost perfect speech synthesis and in an extremely small package... Voice recognition is a whole different beast... for starters they had a phone number you could call (1-800-GOOG-411) which stated that your calls may be monitored or recorded for future use, Google records the conversations and uses them for recognition . As an example of how now extensive this is, the United States has a population of around 300 million people and ill say that only 50% knew the number existed and called it let's say that they want to know the weather for San Francisco, they would say to the phone weather San Francisco, this comment was recorded... that's 150 million different people saying San Francisco, this gets logged and now there are 150 different pronunciations of San Francisco...

Sent from my SGH-T999 using Tapatalk 2
 
  • Like
Reactions: brandall

rogerbrowne99

Member
Oct 20, 2008
10
12
Slough
will there be a way to run utter "live" from the lock screen? so I can just speak the "activate" word and then give a command without having to touch the phone? anyway to run it live "ALL" the time whenever its plugged into power or something like that?

I can just say "computer" it chimes and then give my command?

Listening mode would be great but I suspect you would need to be connected to a power source to make proper use of it

Except when driving, I still think the shake mode is a good compromise.

At my lock screen I just give it a shake and I can make calls, put on the led torch, find the weather etc,

I don't even need to enter my password to get into my phone.

I give it a Tasker command ("the password")

The task that runs is just a simple keyguard unlock then home command.

This gets past the password lock but I have a Tasker condition to put the keyguard back on after the screen
is off for five minutes.

Does anyone have anymore tips?
 
Last edited:

brandall

Senior Member
May 16, 2008
2,679
5,222
Birmingham
It doesn't ask me to send the message either. It hands over the application back to the user once you accept the proofread option and requires a manual send. :eek:
If you choose not to proofread then it sends the message as it should. :)
I need to revisit the 'proof reading' options:

Facebook - utter! will read out loud the message and ask if it's ok
Twtter - proof read sends you to a web box you can press send from ** In possibly the most backwards thought out decision of the year, or ever, Twitter disabled their API being able to populate the tweet to check into their Android app, saying that it encouraged auto-generated content - However, the API does allow you to just send it without the user checking at all!?!?! ... :cyclops: ***
Email/Text - Sends it to the available app - Once sent to the app, utter! loses control.

I think the option is to add all possible proof reading options for all of the above. I'll put it on the list!
Yep, there's been rumors for a long time that Google is working on a full-fledged voice assistant. Google Voice Search is slowly creeping into that territory, and I have no doubt that eventually it will become just that.
Google won't make it. I mean, they will, but it will be same crap as the market (vel. play) and similar. Google is good only with new ideas, but of course, they have the development power... if only a smart, conscious person would lead them... :) (and wouldn't look for money so much :p)
I didn't know about such rumours... My concerns that Google will release something that makes utter! pointless are lessening all of the time. With every customisation included the possibility of the user doing something wrong and giving them a negative experience increase - I don't think Google will want to risk stepping into this zone.
will there be a way to run utter "live" from the lock screen? so I can just speak the "activate" word and then give a command without having to touch the phone? anyway to run it live "ALL" the time whenever its plugged into power or something like that? I can just say "computer" it chimes and then give my command?
Listening mode would be great but I suspect you would need to be connected to a power source to make proper use of it
Absolutely, at present the only option for a wake-up-phrase kills the battery - it would have to be available only when the device was plugged in. Unless Google sort out their offline recognition for us!
Except when driving, I still think the shake mode is a good compromise.

At my lock screen I just give it a shake and I can make calls, put on the led torch, find the weather etc,

I don't even need to enter my password to get into my phone.

I give it a Tasker command ("the password")

The task that runs is just a simple keyguard unlock then home command.

This gets past the password lock but I have a Tasker condition to put the keyguard back on after the screen
is off for five minutes.

Does anyone have anymore tips?
Liking the use of the unlock task! Is that with secure settings, or can Tasker do this alone now? Any chance you could post the basic task for others to have a look at please?

On the subject of Tasker, I need to ask a favour please: Could someone running Jelly Bean back-up their Tasker set-up and then uninstall it. Reboot, reinstall it and see if utter! can execute tasks, or if the permissions to do this are not recognised? I'm just wondering if this has been fixed in Jelly Bean. Thanks meter +1 waiting! Waiting for the dev of Tasker to come back to me about why my plugin is crashing. Hopefully soon!
Google has done a lot with the TTS and Voice recognition realm... they bought a company that specialized in TTS video game voice overs... from a text file it would generate almost perfect speech synthesis and in an extremely small package... Voice recognition is a whole different beast... for starters they had a phone number you could call (1-800-GOOG-411) which stated that your calls may be monitored or recorded for future use, Google records the conversations and uses them for recognition . As an example of how now extensive this is, the United States has a population of around 300 million people and ill say that only 50% knew the number existed and called it let's say that they want to know the weather for San Francisco, they would say to the phone weather San Francisco, this comment was recorded... that's 150 million different people saying San Francisco, this gets logged and now there are 150 different pronunciations of San Francisco...
Interesting read - I had no idea they'd done this, I bet it got them a lot of useful data! I've no doubt they will end up with the most sophisticated system out there; it's just a matter of time..

Run into a few frustrating headaches on the current test build, going to keep going until I drop, to try and get it out for you to play with....
 

nerys71

Senior Member
May 13, 2008
2,107
422
how does s voice do it from the lock screen?

can you make your program invoke a proper widget? this way i can "add" the widget to the lock screen. hit power tap the icon done. right now the only way to get a "command icon" is from within your program. but there is no way to "MOVE" that icon to the widget locker.
 

brandall

Senior Member
May 16, 2008
2,679
5,222
Birmingham
how does s voice do it from the lock screen?

can you make your program invoke a proper widget? this way i can "add" the widget to the lock screen. hit power tap the icon done. right now the only way to get a "command icon" is from within your program. but there is no way to "MOVE" that icon to the widget locker.
S-Voice is built into the framework, so will have 'root' permissions.

You can create a custom icon using many shortcut creating apps - It'll need to launch the activity LauncherShortcutActivity. In some apps you'll need the package name followed by the class name:

com.brandall.nutter
com.brandall.nutter.LauncherShortcutActivity.class (you may or may not need to add 'dot class' at the end.

A homescreen widget is on my list to do....
 

krzych

Senior Member
Jul 29, 2009
1,141
301
Warsaw
Google has done a lot with the TTS and Voice recognition realm... they bought a company that specialized in TTS video game voice overs... from a text file it would generate almost perfect speech synthesis and in an extremely small package... Voice recognition is a whole different beast... for starters they had a phone number you could call (1-800-GOOG-411) which stated that your calls may be monitored or recorded for future use, Google records the conversations and uses them for recognition . As an example of how now extensive this is, the United States has a population of around 300 million people and ill say that only 50% knew the number existed and called it let's say that they want to know the weather for San Francisco, they would say to the phone weather San Francisco, this comment was recorded... that's 150 million different people saying San Francisco, this gets logged and now there are 150 different pronunciations of San Francisco...

Sent from my SGH-T999 using Tapatalk 2
I do not know much about speech recognition, but we all know, they are trying hard. About TTS, I can say only this: if polish students can do better just having few people working at it, then Google sucks ;). Of course, it's not how it exactly looks, but I can assure you that Google could do better than Ivona, but they just do things like always, just to make customers pay ;) (virtually, by ads and so on)

I didn't know about such rumours... My concerns that Google will release something that makes utter! pointless are lessening all of the time. With every customisation included the possibility of the user doing something wrong and giving them a negative experience increase - I don't think Google will want to risk stepping into this zone.
No worries. They will try as always, but they will abandon this as always commenting like "little demand" or similar. The revolution will start by the big companies only, and only when the neural-net's like things will go public. But this is whole bloody story which will change all the world... let's hope not like in Terminator ;D hahahah

A homescreen widget is on my list to do....
Very good idea. Will be even better when connected to "continuous-recording" on tap :). And yes, I know it's far future ;).
 
  • Like
Reactions: brandall

GLDavis

Senior Member
Sep 25, 2012
998
845
Virginia Beach
Interesting read about the lock screen. I use a pattern lock so I decided to test this. So with pattern lock enabled i shake and voila, Utter responds.... Cool. I can make a call with no problems. I can open apps, do a Google search BUT cannot see the results OR access the app until I unlock the phone. I like that! Just wondering if you lost your phone how hard would it be to get past a locked phone?


Sent from my PC36100 using xda app-developers app
 
  • Like
Reactions: brandall

CarsnGadgets

Senior Member
Sep 1, 2010
295
97
Weymouth
Redmi Note 10 Pro
Tasker has an issue with jelly bean and ICS pattern locks still, the pin and password locks are turn-off-able, but the pattern lock keeps coming back, at least on Samsung phones and the Nexus 7. Its been reported on every forum I have looked at relating to tasker and lockscreen.
 
  • Like
Reactions: brandall
Status
Not open for further replies.

Top Liked Posts

  • There are no posts matching your filters.
  • 695
    Welcome to Saiy... Install - Mod edit: Broken DL link removed.

    For those of you visiting this thread as subscribers to utter! you'll know the history well. A promising release, active development and then silence..... Please accept my apologies - if you don't know that 'sometimes life gets in the way' then you are the envy of most.

    A bit of history for you...

    To cut a half-decade-long story short, the Fragmentation of Android ground this project to a halt. When I first demoed my creation to the world, I had visions of knocking it up on Android and then focusing on how it functioned in the background. I was about to drown myself in machine learning to bring my vision to life. Job done? Well no...

    It turned out that developing an app that covered almost every function available on an Android device (I refuse to say 'phone' - it's not 1983!) , was a job for 1,000+ developers, not just a lonesome one such as I, on a 10 year old Dell laptop - and each time a new update to Android was released, I huddled in a corner and wept, as I waited for the crash reports and 1* ratings to roll in.

    It turns out that bugs aren't only specific to Android versions. You multiply that by Manufacturers messing with builds - and devices running multiple versions and then even the Locale of the device causing crashes and you end up with 12,000+ supported devices exponentially multiplied by all other eventualities as your user base.

    I drowned... And my (in hindsight) naive plans of master AI'ery, whilst users enjoyed playing with it on Android, dropped down to the bottom of my to-do list. Things had to change.

    I decided to shut myself away in a dark room to completely rewrite the Android code, so that it was both readable and scalable; despite its complexity. Rumours flew that I had died - and in some ways, I did....

    Not really ^ that just felt justifiably dramatic! :cyclops:

    So, utter! is reborn as Saiy® and Open Sourced, so it may have a chance of competing with the big boys, before they run away with all of our private data and souls, in order to use their services...

    Install Saiy from the Play Store - here

    Note - a direct download link will appear here shortly!

    You can get involved by checking out the Development Section in the app, or alternatively, if you're a curious Android Developer, check out the base code published on GitHub here

    The code base is pretty large, so briefly, there are two major classes in the app, that direct and distribute work elsewhere:

    SelfAware is the main Foreground Service, responsible for managing the application state and channelling voice recognition, text to speech and other API requests.

    Quantum is the main processing class, where commands are locally resolved (if required), sensibility checked and actioned.

    Understanding the above two classes is essential to following the flow of the full application logic.

    MyLog is a global verbose logging toggle. When enabled, the output will flow class to class, as well as display durations for time sensitive functions.

    The following remains for the sake of me needing some sleep and posterity :cowboy:

    About this thread

    Firstly, I’d like to thank everyone for the positive feedback and encouragement and the huge amount of messages I’ve received on YouTube, email, twitter, G+ and here on XDA volunteering to be involved in beta testing. It’s very much appreciated and I apologise that I cannot respond to them all. Please take this intro as a thank you.

    This thread is for your open discussion about utter! and the features you'd like to see included, so please feel free to throw your ideas back and forth (be nice to each other) and I’ll do my best to keep up with them when I have time.

    About utter!

    Unlike other voice applications, utter! will be configurable to the user, enabling you to assign spoken keywords to use the functionality of your favourite applications or make system related changes to your device. There's no cumbersome overlay or launching of another application to use the features, utter! sits in the background ready to be activated, whenever you want, without interrupting whatever you are doing.


    Which applications will initially be compatible?

    The more interest I can demonstrate in utter! the more likely your favourite application developer will want to make their functionality available to use. The purpose of the YouTube video and this thread is to get their attention and create a more functional first beta release.

    I’m a developer and I'd like features of my application to be utter! compatible.

    Please contact me to discuss how our applications can work together.
    119
    FAQs

    Q) Will an offline speech engine be built in?
    A) I hope so. I'm waiting to see what features of Google Now are made available to developers

    Q) What languages will it be available in?
    A) At first, English only. Once I have the framework functioning correctly, I can turn my attention to translations (thank you for the messages I’ve received offering translation help).

    Q) Will it use natural speech recognition?
    A) Over time it will, but in the testing stages commands will be more structured. As my algorithms develop, so will the application's ability to recognise exactly what you want.

    For the conversation mode I’m really up against it. I’m almost trying to reinvent the wheel knowing that Google are no doubt sitting on a very advanced algorithm purposely for this… They are more than welcome to allow me to use it…

    Q) How much is utter! going to cost when it’s out of beta?
    A) I don’t know as yet. Not more than a couple of dollars... I just need to make sure that whatever the price, it's more than worth it.

    Q) Which speech engine does it use?
    A) In the video I used IVONA beta (available on the Market here). This option is configurable so you can use a free or premium engine of your choice.

    Q) Google’s Project Majel will no doubt surpass this application. Why are you bothering?
    A) Perhaps.. It remains to be seen the direction Google take and whether their focus will be too much in the interest of nudging you towards Google services, rather than providing an open and configurable voice integrated assistant.

    For example: If you assign ‘Save Battery’ to a command, on detection utter! could go ahead and minimise your brightness level and screen time-out, turn off (or restrict) all data connections, set your device to GSM only, turn off vibrate functions and screen animations, underclock and undervolt your CPU (requires root) etc etc.. Is that what you expect from Majel? Personally, I don’t… [Update - I think I was right about this!]

    Q) How do I register to beta test!?
    A) Hang around this thread – thank you.

    Q) The icon you used in the video for utter! was lame!
    A) Yes! I just borrowed the inbuilt icon for now. If you think you can design a better one, please feel free! Maximum respect (at the very least) from the first post is offered in return!

    Q) Can you adapt Siri to do these things please?
    A) I honestly have had these requests – I’m afraid that’s not going to be possible now… or in the near/far future /ever…

    ----

    By genisis7
    attachment.php



    By goander
    Utter.jpg



    By joshaw
    attachment.php
    attachment.php



    By usaff22
    Capture.png
    101
    utter! release progress


    • FACEBOOK
    • TWITTER
    • CONTACTS
    • TASKER
    • BLUETOOTH
    • WIFI
    • MOBILE DATA
    • WORLD TIME
    • WORLD WEATHER
    • ROOT FUNCTIONS

    Credits


    pingpongboss - amazing StandOut library!
    usaff22 - amazing icon and artists impression work
    meadowsjared - Sharing his coding skills
    nobnut - previously unknown generosity
    waydownsouth - previously unknown patience and sharing of knowledge
    fahadayaz - Bug solving GEEK

    Permissions

    Code:
    All permissions are for device based command purposes. NONE of your personal data is uploaded or shared to any external server of any kind

    Change Log

    Code:
    V2.2.5A RELEASED
    
    Email
    Text
    Navigation
    Facebook
    Twitter
    Directory Searches
    Car Locator
    Play Music
    Visual Results
    Wave-to-wake
    
    + many more features added! Please see the command list in the app for details.
    Changed to foreground application with permanent notification to stop Android killing it!
    
    V2.2.4A RELEASED
    Skype fixed
    FC's fixed on some commands
    Speed increases
    Added troubleshooting menu
    Changed icon display
    Enabled background test code (hidden).
    
    V2.2.1A RELEASED
    Code and UI revamp.
    Converted to pre-beta background app
    Usage details in the application.
    
    V2.1.9.1A RELEASED
    
    MASSIVE CHANGES!
    Simply too many to list... 
    All details in the app
    
    V2.1.0A RELEASED
    
    Mobile data
    Twitter
    Facebook
    Contacts (algorithm test)
    
    Dropped 2.1 compatibility
    Fahrenheit added to weather
    Initialisation tweaked
    Custom listener tweaked
    Button labels and Loquendo sample now family friendly :eek:
    HUGE code rebuild
    
    V2.0.1A RELEASED
    
    Tasker integration!
    World Weather
    Custom Listener test
    Long-press-search integration
    Loads of bug fixes and code improvements.
    Fixed Weather and Time force closes on 2.1 & 2.2 devices
    
    V1.9A RELEASED
    Root-functions fixed
    Tablet compatibility fixed
    Errors when no recogniser fixed
    Loads of bug fixes and code improvements.
    
    V1.7A RELEASED
    Root-functions included!
    FIXED - Recogniser button errors
    Loads and loads of bug fixes and code improvements.
    
    V1.6A RELEASED
    World-Time included
    FIXED 'unknown' Bluetooth state message.
    Loads and loads of bug fixes and code improvements.
    
    V1.5A RELEASED
    Bluetooth voice control test included
    FIXED the V1.4A 4.0.3 ICS crash
    FIXED FC on back button from config tab
    FIXED FC when closing app
    FIXED FC for Galaxy Nexus TTS settings
    FIXED leaked Receiver
    Loads and loads of bug fixes and code improvements.
    
    V1.4A RELEASED
    WiFi voice control test included
    Loads of bug fixes and code improvements.
    
    V1.3A - RELEASED
    Fixed FC on Config Tab
    
    V1.2A - RELEASED
    Release version 'jumped' to match Play Store
    Totally rewritten UI code
    Totally rewritten engine logic
    Prevented override of localised English voice
    Added test contact loader
    Intro changed to audio file
    Option to record output to sdcard for translation help
    So much else that I've forgotten...
    
    V0.0.1A - RELEASED
    Long presses for association are not functional yet

    BUGS
    Code:
    IVONA registers itself in error, even if it may actually work. A full uninstall and reinstall of the IVONA files is required.
    Buttons don't reactivate occasionally after utterance - 'utterance' code depreciated.
    Weather and Time API's are useless for USA State searches. Need to change provider.

    utter! stable version is available from Google Play here.

    Saiy stable version is available from Google Play here.

    Latest test releases can be found in my more recent posts
    31
    Please release an Alpha version!!! PLEASE!!!

    JP
    Thanks for all of the comments and support folks. For all of those eager to test something I'm going to knock up a configuration apk over the weekend. Nothing too exciting I'm afraid, but digesting the comments I've realised I really need to initially focus on the following:

    1) Getting the very basic framework out so if you'd be so kind to test on your various devices I can catch early device, Android version, custom ROM/kernel bugs and get them right from the start. I need to make sure that if the correct voice data etc isn't already installed on your device I handle it correctly, depending on your location and other things.

    2) Configure the app to be multilingual from the start - If I leave this until a later date, I know I'll end up putting it off with it coming second to bugs and fixes and enhancements etc. Thank you for all of the messages offering translation support, it's really appreciated.

    3) A few of you have mentioned your accents not working well with other speech apps. This got me thinking that I need to be able to allow you to configure the raw voice data, rather than just keywords. So, for example, if you say 'load app cut the rope' and the voice data returns 'low dap cudder hope' (or something else just as random), then I need to be able to allow you to view the raw voice data and if necessary assign 'low dap cudder hope' to open cut the rope! That way it will be accent issue free for many of the functions.

    So, I'll work on a simple test apk over the weekend then you can all have a mess with your raw voice data and let me know your findings depending on your language etc. Thanks in advance.

    4) Build in a simple (at first) offline speech engine, that can be improved version by version. I'm looking at integrating Sphinx, but it's not going to be easy as there is limited documentation.

    5) Create an option that can immediately change utter! to text interaction only, rather than having to abort what you are doing and adjust this in the preferences. So, a 'mute' button of sorts.

    -------

    Again, thank you for all of the suggestions, I do scan through them when I have time and it's really helped me prioritise what I should be doing to make utter! more useful to you.

    Please keep the suggestions coming and perhaps let me know your thoughts on the following:

    a) What voice engines do you use? I want to be able to refer users to a link on the market (or direct download) for a suggested engine. I know this is a personal preference, but if for a certain language there seems to be majority agreement, then I'll go with that.

    b) Do I need to allow the default search engine to be configurable? Does anyone not use Google...?!

    c) What are your experiences of other voice applications? What are your personal essential features that they include and what are they missing?

    -----------

    Funding Update

    I've spoken to many potential partners and a very long story short, they each have their own commercial agenda, all of which I believe detract from the biggest appeal of utter!

    I want to be able to focus on utter! allowing you to use your own personal favourite applications, because that is the experience you have chosen, one which I want to enhance. I do not want to 'team up' with a limited number of selected commercial partners to provide your results in the hope you will then use their services.

    So, how can I achieve this?

    I'm going to go down the kickstarter route to raise as much funding as I can for the application. Briefly, every time I think of a function I wish to include, I think of a hundred eventualities and adjoining functions, I simply cannot do this to its full potential alone, so the funding will go towards the cost of outsourcing the Java development to assist me to build the application to the highest standards and quickly (without cutting corners)!

    Most importantly, I can focus on building the application based on its appeal, rather than a steered commercial angle.

    At the point the money I raise runs out, I will have the amount of downloads and popularity (hopefully) of the application to entice commercial partners and go from there.

    Having assessed all of my options, I think this is the best way forward all-round.

    Once I've written up the kickstarter brief (so much to do!), I'll post the link here. Just to clarify, you are not in any way expected to donate and never would be. If you do donate, you will of course have my gratitude and that of others. When the link appears, you all have the power of social media at your fingertips, so if you could share it on twitter, G+ etc that would be absolutely fantastic...

    Right, I best stop waffling and get coding!

    brandall
    26
    Tasker Plugin

    Please note - This thread will be updated specifically for Saiy very soon - I've been busy! I hope you can follow along, replacing utter! with Saiy in the mean time..... Do make sure you download the correct XML files at the bottom of this thread.

    Tasker Plugin Tutorial

    At present, the plugin has three main features:

    1) Speak
    2) Notify
    3) Pass variable values.

    Please download and import the attached utter! example project, to make it easier to follow the processes explained below. As with all tasks you haven’t created yourself, it’s a good idea to review the content within the XML on the sdcard, prior to importing them, as they could contain rouge actions. On this occasion, it might be ok to trust me though :)

    From the Tasker preferences, make sure beginner mode is not ticked and under the miscellaneous tab, tick ‘Allow External Access’ so the two applications can communicate.

    If you don’t see the imported utter! project, on the main Tasker screen, slide your finger downwards from the top of the screen (there is a faint white downwards pointing arrow), which will reveal the submenus.

    Make sure Tasker is turned on!

    Speak & Pass Variable Values

    The Profile smsReceivedU is activated when a text is received from anyone and triggers the task smsU. Click smsU to open it.

    You’ll firstly see a variable set action that is creating the voice content you want to pass to utter! The content consists of the message details. The second action is the Plugin configuration – tap on the action and then click edit.

    In the text box, you’ll see the content that you are going to ask utter! to speak. It can consist of just plain text, just variables, or a combination of both. For the sake of example, I used both. Save out of the plugin and go back to the task. Press the ‘play button’ at the bottom right of the task to ‘test’ it.

    At this point, Tasker may alert you that it will need to start monitoring your messages in order for the task to work correctly, accept the confirmation. When you press play, you hopefully will hear the information, including the populated variable content announced by utter! If you don’t, you may need to send yourself a quick text to populate the variables.

    * Note: occasionally, the activation of monitors (such as incoming texts) can be delayed. Saving all the way out of Tasker and opening it again seems to resolve this. Send yourself another text afterwards to confirm.

    When the profile smsReceivedU is ticked as active, your text messages will be read aloud when they are received. However, there may be situations when you want to choose whether you hear them or not.

    Head to the profile smsRecievedU – Interactive and open the triggered task smsU – Interactive. In action 1, you’ll see the content you are going to request utter! says and it contains a question about whether or not you want to hear the received message (the trigger for this task). Action #2 sets the message data that utter! will announce, if you confirm that you’d like to hear it. Click on the plugin action (#3) and press edit – You’ll see that the box ‘Send Value Only’ is ticked and the variable %smsdata (the message data) is entered to be sent to utter!

    * Note: There is currently no way for an external application to access the values of your Tasker variables (without a hack) due to privacy issues, as some of the data within the variables may be personal. As this is the case, you’ll need to select ‘Send Value Only’ in any tasks where you would like utter! to be made aware of a change in a variable value. These values are stored separately and securely by utter!

    Heading back to the task, action #4 is another plugin action, this time requesting that utter! immediately speaks the content of %tosay (configured in action #1). You’ll see that the ‘Start Listening’ box is ticked as you’ll need to answer the question!

    Now, when utter! announces ‘You’ve received a text message. Would you like me to read it?’ you’ll be aware that replying ‘yes, please read the message that is contained in the tasker variable I sent to you’ will result in utter! not knowing what on Earth you’re talking about! We need to configure a custom phrase within utter! to handle this.

    Heading into utter! select the customisation tab and click ‘Custom Phrase’. This is going to be handling our reply when utter! asks the question above. Enter ‘please read it’ in the phrase field (top box) and in the response box (bottom box!) %smsdata (case sensitive) and press ‘Create’.

    The result is, that when utter! asks ‘You’ve received a text message. Would you like me to read it?’, you can respond with ‘please read it’ and the custom phrase handler will detect the response is a variable name. The variable name will be substituted in the response for the current variable value (you sent it to utter! in action #3 of the task smsU – Interactive) and hey presto, your text message will be read out!

    For completeness, you may wish to add another custom phrase, where the phrase is ‘not right now thanks’ and the response is ‘okay, well let me know when you do’ – just to give it a little more personalisation… a bit nicer than just saying ‘cancel’.

    There are two more tasks in the project - drivingOnU and drivingOffU. These task turn the profile smsRecievedU – Interactive, on and off. You may wish to activate the Tasker profile 'hands-free' whilst driving, so from the Customisation Tab, click on 'Create Commands' and select 'Run Tasker Task'. Click on 'drivingOnU' and when prompted by utter! set the command phrase to something like 'I'm driving', the success words to 'drive safely' and the fail words to 'something went wrong!'

    Whenever you wish to activate the profile and have your messages announced, you can say 'I'm driving' and the profile will become active. Do the same again for the 'drivingOffU' task and you'll be able to toggle them by voice whenever you need to.

    To summarise, any variable values you want to keep utter! updated on so you can request the current value by voice, pass them in a ‘Send Value Only’ plugin action. Don’t forget, in the ‘Speak’ plugin action, utter! accepts plain text and variables in any order or combination.

    Notify

    Stipulating the interaction level of utter! (speak uninvited whenever / speak uninvited if I’m not at work etc) is coming soon, but in the mean time there may be variable content within your tasks that you’d like to be made aware of, but only when you’re good and ready. For this reason, you can connect the speech data to a notification, that when clicked, will begin to announce the content. This is handled by utter! rather than the Tasker ‘notification click’ action.

    The profile unlockedU is a quick example of how this works. The trigger is the device being unlocked (so you can easily test it out) and it fires the task infoU. The task infoU places some random device info into the variable %anyname in action #1. The utter! plugin action #2 sets the content for the notification (%anyname) and the box is ticked to confirm the Notification action.

    Press the ‘play’ button and you should see a notification appear. Clicking on it will begin the speech of the content placed in %anyname.

    To avoid cluttering the notification bar, currently only one notification is available. If another is activated, it will replace the content of the first one. This will eventually be configurable.

    Tasker, Llama & External App Support

    Users have requested to be able to switch the application off and on under certain circumstances and so I've exposed the ability to do this and other functions using intents from external applications. The applications must allow you to add 'data extras' to intents and you can now do the following:

    1 - Start the voice recognition
    2 - Start the permanent voice recognition
    3 - Stop the permanent voice recognition
    4 - Turn utter! off completely
    5 - Flush the memory utter! is using
    6 - Toggle driving mode
    7 - Toggle caller announcement
    8 - Toggle notification announcement

    I've created a separate Activity to do this. Normally you have to list the package name, followed by the class name.

    Package name - com.brandall.nutter
    Class name - com.brandall.nutter.EIH

    To stop any rogue applications firing off commands to utter! you need to go to the Power User Settings and create an alphanumeric password. That password will need to be entered in the data extras of the intent, to confirm you have 'authorised' it.

    Here is a screenshot of how it would look in Tasker. The command:2 refers to the permanent recognition above. If you put command:4 it would turn utter! off. The password:mypassword123 is the password of mypassword123 you set in the Power User Settings.

    attachment.php


    Each action must be labelled and immediately followed by a colon: before the content.

    command:
    password:

    I hope the above made sense?!

    ------------------------------------------------------------------------------------------------------------------------------------------------------
    NEW! - ADVANCED TASKER RECEIVER PROJECT

    Note: I've been informed that this project does not import correctly on the Gingerbread version of Tasker. I'm investigating.

    Before you read any further, this is for the more advanced Tasker users, who want to handle voice commands with their own custom Tasks. If you're new to Tasker, perhaps return to this post at a later date and get started with the project detailed above!

    For those that are still with me, import the Tasker Receiver Project below and then you'll need to tick the Send to Tasker box in the Try Again options (in the utter! Advanced Settings). Once you've done that, all unknown commands will be broadcast to Tasker to handle and the project you've imported will resolve them, or not....

    First thing to remember is that if utter! knows to send the voice data to Tasker, then it will not speak a response, such as 'I didn't understand that request' or something equally as irritating! It'll be up to you to handle any responses in Tasker - including when the voice data doesn't trigger a Task you've created.

    In the project there is a Master Command List, which at present contains the three commands, sausages, beans and chips. These are populated into an Array, which for all intents and purposes, is a list..... Every time Tasker receives voice data from utter! it will refresh this list, to make sure it picks up on any new commands you've created.

    It then gets a bit tricky - The voice data is also in the form of an Array, so the Task 'Voice Data Receiver' loops through each item in the voice data Array, whilst also looping (in a nested loop) through each item in your command Array. Basic pattern matching takes place on each command, so that if the voice data in any place contains the word 'sausages', it will trigger a match. This is using the Tasker regex of *sausages*. You can of course use more advanced regex should you wish.

    If a match is made, the Task looks for the index of the command number and that will then trigger the corresponding Task called 'CommandAction#', where the number at the end is populated by the index variable in the loop. The voice data is handed to this Task as a parameter, so you can do further analysis to look for more matching words, should you wish. An example would be a trigger command of 'display' and a further check for 'off' or 'on' in the voice data. The CommandAction# Tasks finish by performing an action (of course!) and providing the vocal response to utter!

    If no match is made, the negative voice response is set at the bottom of the Voice Data Receiver Task.

    There are notes placed inside the Tasks, which I hope will help to make the process a little more clear for you, as I confused myself writing the above :cyclops:

    If you get stuck, there are plenty of users and threads here on XDA that will come to your rescue. Check my signature below for a couple of links.