Deep learning for daily care: medicine recognition and reminder systems for the visually impaired
, , , y
10 jun 2025
Acerca de este artículo
Publicado en línea: 10 jun 2025
Recibido: 18 mar 2025
DOI: https://doi.org/10.2478/ijssis-2025-0025
Palabras clave
© 2025 Uttam Waghmode et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Figure 1:

Figure 2:

Figure 3:

Figure 4:

Figure 5:

Figure 6:

Figure 7:

Figure 8:

Figure 9:

Figure 10:

Figure 11:

Figure 12:

Literature review
[ |
YOLO and OpenCV | Provides real-time object detection and visual replacement for the blind. | YOLO may struggle with small or distant objects, and OpenCV's accuracy can vary based on lighting conditions and object complexity. |
[ |
TensorFlow API, CNN, SSD, and MobileNet V2 | Achieves high accuracy without needing an Internet connection. | May require significant computational resources, especially for training the model. |
[ |
CNN | Utilizes data augmentation to achieve a 94% accuracy rate for banknote recognition. | Edge-detected images negatively affect accuracy, indicating a need for larger datasets and varied lighting conditions. |
[ |
CNN | Improves runtime and accuracy for heart rate estimation in large groups. | Specific details about the adapted algorithm and its implementation are needed for a thorough evaluation. |
[ |
YOLO and SSD | Uses Raspberry Pi devices for a compact travel aid, demonstrating real-world implementation. | The system may face limitations in detecting objects in complex environments or under varying lighting conditions. |
[ |
CNN and LSTM | Uses Braille and sound bite hearing devices for communication, achieving high accuracy. | The system's effectiveness may depend on the user's familiarity and comfort with Braille. |
[ |
FSP algorithm | Utilizes a panoramic camera for pedestrian trajectory prediction, improving real-time performance. | The algorithm's accuracy and performance in dynamic or crowded environments need further evaluation. |
[ |
CNN | Achieves 98% accuracy in diagnosing glaucoma using retinal images. | The effectiveness of the technique in clinical settings and its generalizability to diverse populations need further validation. |
[ |
Four-layered CNN | Detects and classifies objects with high accuracy and low response time. | The device's performance may vary based on the complexity of the environment and the types of objects present. |
[ |
CNN and fuzzy logic | Provides auditory feedback for obstacle detection, enhancing interaction with surroundings. | The computational complexity of the algorithms may affect real-time performance. |
[ |
CNN | Uses smart glasses to identify medicine, showing promise for real-world implementation. | The system's accuracy and reliability in identifying specific medications need further validation. |
Comparative study for the success rate of the proposed system with existing work
[ |
CNN | ∼2000 images | 92 | 3 | Smart glasses integration |
[ |
CNN + LSTM | ∼3000 images | 94 | 5 | Includes Braille output |
[ |
CNN | Augmented banknote data | 94 | Not medicine-specific | Focused on money recognition |
Inception CNN | 16,000 images | 4 | Audio output + medication reminder |
Success rate of the proposed system
1. | Strepsils | 46 | 4 | 92 |
2. | Volini gel | 48 | 2 | 96 |
3. | Dolo | 47 | 3 | 94 |
Success rate≥ | 94 |