Machine Learning
Rekognition
Find objects, people, text, scenes in images and videos using Machine Learning
Facial analysis and facial search to do user verification, people counting
Create a database of “familiar faces” or compare against celebrities
Use Cases:
Labeling
Content Moderation
Text Detection
Face Detection and Analysis (gender, age, emotions, etc.)
Face Search and Verification
Celebrity Recognition
Pathing (ex: for sports game analysis)
Rekognition – Content Moderation
Detect content that is inappropriate, unwanted, or offensive (image and videos)
Used in social media, broadcast media, advertising, and e-commerce situations to create a safer user experience
Set a Minimum Confidence Threshold for items that will be flagged
Flag sensitive content for manual review in Amazon Augmented AI (A2I)
Transcribe
Automatically convert speech to text
Uses a deep learning process called automatic speech recognition (ASR)
Automatically remove Personally Identifiable Information (PII) using Redaction
Supports Automatic Language Identification for multi-lingual audio
Use cases:
transcribe customer service calls
automate closed captioning and subtitling
generate metadata for media assets to create a fully searchable archive
Polly
Turn text into lifelike speech using deep learning
Allowing you to create applications that talk
Polly - Lexicon & SSML
Customize the pronunciation of words with Pronunciation lexicons
Stylized words: St3ph4ne => “Stephane”
Acronyms: AWS => “Amazon Web Services”
Upload the lexicons and use them in the SynthesizeSpeech operation
Generate speech from plain text or from documents marked up with Speech Synthesis Markup Language (SSML) – enables more customization
emphasizing specific words or phrases
using phonetic pronunciation
including breathing sounds, whispering
using the Newscaster speaking style
Translate
Language translation service
Can use to localize content:
translate websites and applications for international users
translate large volumes of text efficiently
Lex & Connect
Lex
Same technology that powers Alexa
Automatic Speech Recognition (ASR) to convert speech to text
Natural Language Understanding to recognize the intent of text, callers
Helps build chatbots, call center bots
Connect
Receive calls, create contact flows, cloud-based virtual contact center
Can integrate with other CRM systems or AWS
No upfront payments, 80% cheaper than traditional contact center solutions
Comprehend
For Natural Language Processing (NLP)
Fully managed and serverless service
Uses machine learning to find insights and relationships in text
Language of the text
Extracts key phrases, places, people, brands, or events
Understands how positive or negative the text is
Analyzes text using tokenization and parts of speech
Automatically organizes a collection of text files by topic
Sample use cases:
Analyze customer interactions (emails) to find what leads to a positive or negative experience
Create and groups articles by topics that Comprehend will uncover
Comprehend - Medical
Detects and returns useful information in unstructured clinical text:
Physician’s notes
Discharge summaries
Test results
Case notes
Uses NLP to detect Protected Health Information (PHI) – DetectPHI API
Store your documents in Amazon S3, analyze real-time data with Kinesis Data Firehose, or use Amazon Transcribe to transcribe patient narratives into text that can be analyzed by Amazon Comprehend Medical.
SageMaker
- Fully managed service for developers / data scientists to build ML models
Forecast
Fully managed service that uses ML to deliver highly accurate forecasts
- Example: predict the future sales of a raincoat
50% more accurate than looking at the data itself
Reduce forecasting time from months to hours
Use cases: Product Demand Planning, Financial Planning, Resource Planning, etc.
Kendra
Fully managed document search service powered by Machine Learning
Extract answers from within a document (text, pdf, HTML, PowerPoint, MS Word, FAQs…)
Natural language search capabilities
Learn from user interactions/feedback to promote preferred results (Incremental Learning)
Ability to manually fine-tune search results (importance of data, freshness, custom, …)
Personalize
Fully managed ML-service to build apps with real-time personalized recommendations
- Example: User bought gardening tools, provide recommendations on the next one to buy
Same technology used by Amazon.com
Integrates into existing websites, applications, SMS, email marketing systems, …
Implement in days, not months (you don’t need to build, train, and deploy ML solutions)
Use cases: retail stores, media and entertainment…
Textract
Automatically extracts text, handwriting, and data from any scanned documents using AI and ML
Extract data from forms and tables
Read and process any type of document (PDFs, images, …)
Machine Learning - Summary
Rekognition : face detection, labeling, celebrity recognition, content moderation,..
Transcribe: audio to text (ex: subtitles)
Polly: text to audio
Translate: translations from one language to another
Lex: build conversational bots – chatbots
Connect: cloud-based virtual contact center
Comprehend: natural language processing
SageMaker: machine learning for every developer and data scientist
Forecast: build highly accurate forecasts
Kendra: ML-powered document search engine
Personalize: real-time personalized recommendations
Textract: detect text and data in documents