Data Engineer Consultant for ASR || Farm Radio International
Welcome to our website fursatimes.com. In This Article, Are you looking for Data Engineer Consultant for ASR || Farm Radio International job opportunity announced.
Terms of Reference for a Data Engineer Consultant for ASR (Automatic Speech Recognition) under the LONGA Project
A. About Farm Radio International
Farm Radio International (FRI) is a Canadian non-profit organization using radio, mobile telephony and other digital tools and services to meet the information and communication rights – at scale – of rural and marginalized communities in sub-Saharan Africa. For rural families in Africa, access to information and communication, e.g. about agriculture, climate change, livelihoods, health and nutrition, are critical needs. FRI is committed to gender equality and inclusion as a cross-cutting priority as well as an end in itself. Our communication solutions are designed to respond to and transform gender relations and the participation of women, youth and marginalized groups at each stage of program design and delivery processes. Obtain more details from the Farm Radio website.
B. Background
With funding from the Wehubit program of ENABEL, Farm Radio International (FRI) is implementing the Longa – Automatic Speech Recognition (ASR) for African Languages (Longa) project. The aim of the project is to transform speech into actionable text with high accuracy and contextual awareness. The project is being developed by FRI with support from CGIAR, leveraging advanced machine learning and natural language processing techniques. For instance, Longa will be used to analyse and respond to the growing number of voice messages that FRI’s radio station partners receive from small-holder farmers through interactive radio programs on our Uliza interactive application. The ‘primary users’ of Longa are the radio station staff – radio presenters or producers, and FRI or other organisations who are gathering voice-based information from farmers/listeners. The ‘end-users’ are the people who listen to the radio programs – the project participants that FRI targets: men and women small-scale farmers. The anticipated outcomes of the project are as follows:
- Improve accuracy for Luganda language and accents recorded via phone call
- Replicate the ASR model for the Bambara language
- Integrate ASR application in FRI’s communication cloud platform for broadcasters
We are looking for a knowledgeable Data Engineer consultant with experience in speech recognition technology to improve the ASR system’s performance and dependability. This consultancy’s main goal is to offer professional advice and technical assistance for the creation, testing, and optimization of the ASR algorithms. To guarantee that the ASR system has the maximum possible levels of precision, effectiveness, and scalability, the consultant will collaborate closely with our development team.
C. Scope of Work
The consultant will be responsible for the following tasks:
- Data Management: Collect, clean, and preprocess audio and textual data to train and evaluate ASR models.
- Data Pipeline Development: Design and implement scalable data pipelines to automate the extraction, transformation, and loading (ETL) of speech data.
- Model Training: Work closely with the data science team to prepare data for training, validating, and testing ASR models.
- Performance Monitoring: Monitor and analyze the performance of ASR models, including accuracy, latency, and error rates. Identify and address any data-related issues.
- Integration: Collaborate with software developers to integrate ASR models into web applications, ensuring seamless functionality and optimal user experience.
- Documentation: Create and maintain documentation related to data workflows, model performance, and integration processes.
- Collaboration: Work with cross-functional teams including product manager, UX/UI designers, and data scientists to align data engineering efforts with overall product goals.
Read also:
D. Deliverables
Data Collection and Management:
- Data collection scripts and tools for gathering speech and text data.
- Clean and labeled datasets for training and evaluation purposes.
- Data storage solutions with appropriate indexing and retrieval mechanisms.
Data Pipeline Documentation:
- Design documents for data pipelines, including flow diagrams and architecture.
- Source code and configurations for ETL processes.
Model Preparation:
- Scripts and processes for preparing and augmenting data for ASR model training.
- Documentation on dataset splits (training, validation, test).
Model Performance Reports:
- Performance metrics reports for ASR models, including accuracy, precision, recall, and F1 scores.
- Analysis and troubleshooting reports on any data-related issues impacting model performance.
Integration Documentation:
- Integration guides and API documentation for incorporating ASR models into web applications.
- Code samples and best practices for interacting with ASR services from web clients.
User Feedback Analysis:
- Reports and insights from user feedback related to ASR accuracy and usability.
- Recommendations for improvements based on user feedback and performance data.
Codebase and Version Control:
- Version-controlled code repository for data processing scripts and pipeline implementations.
- Regular updates and commits to the codebase reflecting ongoing improvements and bug fixes.
Collaboration Reports:
- Documentation of cross-functional team meetings and decisions.
- Progress reports on collaborative projects involving ASR integration.
Testing and Quality Assurance:
- Test cases and results for ensuring data quality and model accuracy.
- Documentation of QA processes and issues encountered during testing.
Training and Support Materials:
- Training materials and documentation for team members on data handling, model integration, and troubleshooting.
E. Duration and Timeline
The consultancy will commence in October 2024 and conclude in August 2025 with a maximum level of effect (LoE) of 80 days. Specific deliverable deadlines will be agreed upon at the start of the consultancy.
F. Reporting
The consultant will report to the Project Manager Nathaniel Ofori and provide regular updates on progress. Weekly meetings will be held to discuss challenges, milestones, and next steps.
G. Elements of application
Interested applicants should submit a technical and financial proposal that includes the following:
- Qualifications and relevant experience of the firm/specific consultant that will be leading the assignment
- Clearly demonstrate experience in Modelling, Fine-tuning, and Deployment of Custom ASR Models
- The general approach that the consultant(s) will take to accomplish the assignment
- Specific tasks/actions that the consultant will undertake to complete the deliverables
- The timeline and work plan
- Proposed budget
How to apply:
H. Application procedure
Diversity and inclusivity are important to FRI, and we encourage applications from anybody who fits the requirements. Applications are highly encouraged from women, individuals with disabilities, and members of other equity-seeking or marginalized populations. In all areas of the hiring process, reasonable accommodations are offered upon request.
Additionally, FRI takes part in the Misconduct Disclosure Scheme, an interagency initiative (https://misconduct-disclosure-scheme.org/). Because of this, when we recruit someone, we will ask their prior employers for information regarding any findings of sexual exploitation, abuse, or harassment during employment, as well as any issues that were under investigation at the time of the applicant’s leave. The job applicant certifies that they have read and understand our recruitment procedures by submitting an application.
Please send your proposals by email to jobapps@farmradio.org before September 24, 2024, if you would want to apply for this position. In the topic line, state: Longa is an ASR data engineer consultant.