【JAPAN AI】AI Quality Scientist / English
- 年収
-
800万円〜1,600万円
- 勤務地
-
東京都
- 職務内容
-
As an AI Quality Scientist at JAPAN AI, you will lead research and implementation of AI agent quality evaluation. Key responsibilities include:
1. Evaluation Metric Research & Development
- Research LLM-as-Judge calibration methods
- Design and validate evaluation benchmarks
- Research reward modeling and preference learning
- Select and design evaluation metrics
- Design and maintain evaluation datasets2. Automated Evaluation Pipeline Design & Development
- Implement scalable automated evaluation pipelines
- Integrate evaluation pipelines into CI/CD
- Design agent evaluation harnesses
- Ensure reproducibility of evaluation processes3. Safety & Quality Verification
- Implement automated red teaming
- Build safety and policy compliance verification frameworks
- Research hallucination detection methods
- Design and execute prompt/tool regression tests4. Statistical Analysis & Experimental Design
- Design and analyze statistical experiments
- Visualize quality trends
- Automate regression detection
- Create quality improvement proposals
- Feed evaluation signals back to R&D teamsThe role focuses on establishing "AI Evaluation Science" within the context of Japanese enterprise AI, scientifically guaranteeing the quality of AI products used by approximately 200 companies.
- 企業名
-
株式会社ジーニー
- 本社所在地
-
東京都新宿区西新宿6-8-1住友不動産新宿オークタワー5/6階
- 雇用形態
-
正社員
- 各種保険
-
健康保険 雇用保険 厚生年金 労災保険
- 休日休暇
-
完全週休二日制 所定休日:土・日・祝日 休暇:年次有給休暇、夏季休暇(3日)、年末年始休暇(12月31日?1月3日)、慶弔休暇
- 情報更新日
-
2026/04/13