The Healthcare Translation Web App is a web-based prototype designed to enable real-time, multilingual translation between patients and healthcare providers. This application converts spoken input into text, provides a live transcript, and offers a translated version with audio playback. The project showcases technical proficiency, development speed, and the effective use of generative AI for efficient healthcare communication.
- Voice-to-Text with Generative AI: Converts spoken input into a text transcript, enhancing transcription accuracy, particularly for medical terms.
- Real-Time Translation and Audio Playback: Provides real-time translation of the transcript with a "Speak" button for audio playback.
- Mobile-First Design: Ensures responsiveness and optimization for both mobile and desktop devices.
The project utilizes modern web technologies and AI tools:
- Frontend: Vite, React, TypeScript
- AI APIs: Gemini AI (for text translation), Web Speech Recognition API (for voice-to-text conversion)
- Styling: CSS
- Deployment: Vercel
The project is structured as follows:
components.json
index.html
package.json
README.md
src/
App.tsx
components/
translation-app.tsx
ui/
button.tsx
card.tsx
global.css
lib/
utils.ts
main.tsx
vite-env.d.ts
tsconfig.app.json
tsconfig.json
tsconfig.node.json
vite.config.ts
- App.tsx: The main component of the application that includes the header, main content and footer.
- translation-app.tsx: The main component of the translation application that handles speech recognition, translation and speech synthesis logic.
- button.tsx: Reusable button component.
- card.tsx: Reusable card component.
- utils.ts: cn function to combine CSS classes.
- The global.css file contains the global styles of the application, including Tailwind CSS settings and custom CSS variables.
- Node.js & npm installed
- Clone the repository:
git clone https://github.com/your-repo/healthcare-translation-app.git
- Navigate to the project directory:
cd healthcare-translation-app - Install dependencies:
npm install
- Start the development server:
npm run dev
- Open the application in your browser at
http://localhost:5173.
The application integrates with the following APIs:
- Gemini AI API for translation and text processing.
- Web Speech Recognition API for voice-to-text conversion.
- Fork the project.
- Create a new branch:
git checkout -b feature/nueva-funcionalidad
- Implement your changes and commit:
git commit -am 'Añadir nueva funcionalidad' - Push the changes to your branch:
git push origin feature/nueva-funcionalidad
- Open a Pull Request.
- Data Privacy: Ensure secure handling of patient conversations to maintain confidentiality.
- API Security: Use authentication and rate-limiting mechanisms where applicable.
- Error Handling: Implement robust error handling for API failures.
This project is licensed under the MIT License.
Author(s): Jaider Salas Burgos