
Search by job, company or skills
Showing 2 jobs
Skills:
Ml, Jax, Pytorch, Python, RLAIF, SFT, DPO, distributed training, Ai, ppo, preference data curation, reward modeling, large language models, synthetic data generation, RLHF
Skills:
Java, Machine Learning, C, Artificial Intelligence, Javascript, Python, Agent-to-Agent, Distributed Systems Design, Generative AI, Model Context Protocol, Agent Development Kit, Go, Multi-Modal Models, Low Level Systems Programming, Large Language Models
