Search

Information Technology_USA - USA_Developer

Real Soft, Inc.
locationJacksonville, FL, USA
PublishedPublished: 6/29/2026
Full time
: MAX CONFIRMED
Location: ONSITE- Fully Onsite Wilmington, DE , Houston TX, Jersey City, NJ , 5 days WFO
Duration: 6 months

Role: AI ML Software Engineer

Descriptions:

Job responsibilities
• Work closely with product managers, data scientists, ML engineers, and other stakeholders to understand requirements and prioritize use cases.
• Design, develop, and deploy state-of-the-art AI/ML/LLM/GenAI solutions to meet business objectives.
• Develop and maintain automated pipelines for model deployment, ensuring scalability, reliability, and efficiency.
• Implement optimization strategies to fine-tune generative models for specific NLP use cases, ensuring high-quality outputs in summarization and text generation.
• Conduct thorough evaluations of generative models (e.g., GPT-4.1), iterate on model architectures, and implement improvements to enhance overall performance in NLP applications.
• Implement monitoring mechanisms to track model performance in real-time and ensure model reliability.
• Communicate AI/ML/LLM/GenAI capabilities and results to both technical and non-technical audiences.
• Stay informed about the latest trends and advancements in the latest AI/ML/LLM/GenAI research, implement cutting-edge techniques, and leverage external APIs for enhanced functionality.

Required qualifications, capabilities, and skills
• Bachelor's or Master's degree in Computer Science, Engineering, or a related field
• 6-9 years of demonstrated experience in applied AI/ML engineering, with a track record of developing and deploying business critical machine learning models in production.
• Proficiency in programming languages like Python for model development, experimentation, and integration with OpenAI API.
• Experience with machine learning frameworks, libraries, and APIs, such as TensorFlow, PyTorch, Scikit-learn, and OpenAI API.
• Experience with cloud computing platforms (e.g., AWS, Azure, or Google Cloud Platform), containerization technologies (e.g., Docker and Kubernetes), and microservices design, implementation, and performance optimization.
• Solid understanding of fundamentals of statistics, machine learning (e.g., classification, regression, time series, deep learning, reinforcement learning), and generative model architectures, particularly GANs, VAEs.
• Ability to identify and address AI/ML/LLM/GenAI challenges, implement optimizations and fine-tune models for optimal performance in NLP applications.
• Strong collaboration skills to work effectively with cross-functional teams, communicate complex concepts, and contribute to interdisciplinary projects.
• A portfolio showcasing successful applications of generative models in NLP projects, including examples of utilizing OpenAI APIs for prompt engineering.

Preferred qualifications, capabilities, and skills
• Familiarity with the financial services industries.
• Expertise in designing and implementing pipelines using Retrieval-Augmented Generation (RAG).
• Hands-on knowledge of Chain-of-Thoughts, Tree-of-Thoughts, Graph-of-Thoughts prompting strategies.

Skills: Digital : Python~Digital : Artificial Intelligence(AI)~AI & Gen AI - Products & Tools
Experience Required: 8-10, Project Code :