ObjectToSpeech

An application for recognizing objects in real time and describing their name and location in the frame by voice

Exemplary messages printed by the application:

I see a person at the top right corner.
I see 2 objects: a chair at the bottom and a person at left.
I see 3 objects: a keyboard at right, a tv at top and a cup at the bottom left corner.

Hardware used in the project:

Raspberry Pi 4
Pi Camera v2

Models used for the object detection:

MobileNet SSD v2 Lite
MobileNet SSD v1

Object datasets used for training:

Common Objects in Context (COCO)
Custom dataset with Rubics Cubes
Custom dataset with people wearing face masks
Custom dataset with pictures of dogs and cats

Text to speech libraries:

gTTs (when connection to the Internet is established)
pyttsx3 (when being offline)

The Application creates the sentences based on the golden ratio division of the frame:

To install required packages run the following command:

pip3 install -r requirements.txt

Finally to launch the application run:

python3 App.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
SSD_MobileNet		SSD_MobileNet
__pycache__		__pycache__
App.py		App.py
README.md		README.md
requirements.txt		requirements.txt
settings.py		settings.py
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ObjectToSpeech

An application for recognizing objects in real time and describing their name and location in the frame by voice

Exemplary messages printed by the application:

Hardware used in the project:

Models used for the object detection:

Object datasets used for training:

Text to speech libraries:

The Application creates the sentences based on the golden ratio division of the frame:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ObjectToSpeech

An application for recognizing objects in real time and describing their name and location in the frame by voice

Exemplary messages printed by the application:

Hardware used in the project:

Models used for the object detection:

Object datasets used for training:

Text to speech libraries:

The Application creates the sentences based on the golden ratio division of the frame:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages