AI Platform Engineer

Remote
Full Time
Experienced
ERP Suites develops Enterprise AI agents and orchestration solutions hosted on Oracle Cloud Infrastructure (OCI). Our products automate complex finance, supply chain, and operational workflows for enterprise customers.
We are seeking an AI Platform Engineer who thrives in a high-ownership environment and is passionate about building, operating, and scaling the infrastructure that powers next-generation AI solutions.
Location:
Our home office is based in Cincinnati, OH. However, we are open to hiring someone who is fully remote regardless of location. Although the position will be remote, there might be some occasional travel to ERP Suites facilities or customer sites.

Position Summary:
The AI Platform Engineer will be responsible for the platform foundation that supports ERP Suites' AI products and customer environments. This role partners closely with the AI Architect and Product Team to deploy, manage, secure, and optimize Oracle Cloud Infrastructure environments, automation pipelines, and AI agent deployment frameworks.
The ideal candidate is a hands-on cloud and infrastructure professional with experience in DevOps, AI platform operations, automation, security, observability, and enterprise cloud architecture.
Daily Activities:
• Monitor OCI service health, logs, dashboards, and alerts.
• Troubleshoot platform issues and customer environment concerns.
• Support development teams with infrastructure questions and deployment needs.
• Manage CI/CD pipeline performance and resolve deployment failures.
• Execute provisioning, onboarding, and configuration requests.
• Update documentation and architecture artifacts as infrastructure evolves.
• Participate in standups, planning meetings, and technical reviews.

Monthly Activities:
• Review OCI consumption reports, billing dashboards, and cost optimization opportunities.
• Conduct IAM, security, and credential audits.
• Evaluate reference architecture environments for configuration drift and required updates.
• Refine deployment methodologies, runbooks, and onboarding documentation.
• Assess Oracle OCI roadmap updates and emerging platform capabilities.
• Contribute technical documentation, architecture guidance, and internal knowledge-sharing content.

Key Responsibilities:
Cloud Infrastructure & DevOps:
• Provision, configure, and manage Oracle Cloud Infrastructure (OCI) environments, including computer, networking, load balancers, API gateways, IAM, containers, and related services.
• Manage OCI Functions, Autonomous Database Serverless (ADB-S), and containerized deployment environments.
• Build, maintain, and optimize OCI DevOps pipelines, artifact repositories, and deployment automation.
• Support OCI Goldengate planning, configuration, and data replication architectures.
• Develop automation solutions that improve reliability, scalability, and operational efficiency.

AI Agent Deployment & Operations:
• Own customer-facing AI agent deployment methodologies, runbooks, environment configurations, and deployment standards.
• Coordinate customer environment provisioning, compartment creation, IAM setup, and onboarding activities.
• Manage AI agent environments across development, testing, and production stages.
• Support development teams through infrastructure reviews, deployment guidance, and technical troubleshooting.
• Maintain and extend ERP Suites' enterprise reference architectures and deployment frameworks.

Monitoring, Observability & FinOps:
• Build and maintain Grafana dashboards and reporting solutions for operational monitoring and customer billing.
• Develop ETL processes that aggregate OCI cost and consumption data.
• Monitor platform health, performance, reliability, and resource utilization.
• Diagnose and resolve observability gaps before they impact customer environments.
• Ensure accurate reporting and billing visibility across customer environments.

Security & Governance:
• Audit OCI IAM policies, Vault usage, credential management processes, and security controls.
• Maintain TLS certificate automation using ACME, Let's Encrypt, and OCI Load Balancer integrations.
• Support secure architecture reviews and infrastructure compliance initiatives.
• Ensure proper access controls, credential rotation, and security best practices across environments.

Technical Architecture & Documentation:
• Create and maintain architecture diagrams, infrastructure maps, deployment workflows, and technical documentation.
• Document automation scripts, deployment processes, and operational procedures.
• Participate in technical planning sessions with customers and internal stakeholders.
• Identify infrastructure risks and recommend scalable solutions.

Qualifications:
Required:
• Bachelor’s degree in computer science, Information Systems, Engineering, or a related field.
• 2+ years of experience in AI Platform Engineering, Infrastructure Engineering, MLOps, DevOps, or Cloud Engineering.
• Strong experience with Oracle Cloud Infrastructure (OCI), including:
• Experience deploying and supporting AI agents, microservices, or cloud-native applications.
• Experience with monitoring and observability platforms such as Grafana, LangFuse, OCI Logging, and Metrics APIs.
• Knowledge of TLS, DNS, ACME protocols, Let's Encrypt, and certificate automation.
• Experience with CI/CD tools, source control, deployment pipelines, and artifact management.
• Proficiency in Python, SQL, and Bash scripting.
• Strong technical writing, documentation, and architecture diagramming skills.
• Excellent communication and collaboration skills.

Core Competencies:
• Cloud infrastructure architecture and administration
• AI platform operations and deployment
• DevOps and CI/CD automation
• Monitoring, observability, and FinOps
• Security architecture and identity management
• Infrastructure-as-Code and automation
• Technical troubleshooting and root cause analysis
• Customer-facing technical consulting
• Documentation and knowledge transfer

Preferred Qualifications:
• Oracle Cloud certifications such as OCI Architect Professional or OCI DevOps Professional.
• Experience supporting multi-tenant SaaS or managed-service environments.
• Exposure to large language model (LLM) infrastructure and agentic AI frameworks such as LangChain, MCP, or similar technologies.
• Experience implementing AI observability platforms such as LangFuse, MLflow, or equivalent tools.
• Familiarity with JD Edwards EnterpriseOne, including CNC, AIS, Orchestrator Studio, or security administration.
• Experience participating in Oracle Partner Network, Oracle ACE, or similar technical communities.

Company:

At ERP Suites, our focus is on helping our customers realize IT’s potential. Our comprehensive ERP solutions enable them to streamline and scale their IT products and processes. And this leads directly to improved efficiency and increased margins. We are a proud Oracle Gold Partner and champion of proactive JD Edwards management and custom product enhancements.
ERP Suites provides technical consulting, cloud services, managed services, and digital transformation solutions for some of America’s top companies. We build secure connections, improve performance, automate workloads and give them mobility. In other words, we help them stay on top.
We deliver multi-functional value through cloud services, digital transformation, ERP consulting services, ERP managed services, and software development. 

Core Values:
  • Make Customers Successful
  • Be An Advisor
  • Be a teacher
  • Be a Coach
  • Have Fun
  • Do the Right Things for the ERP Suites Family
  • Adapt Quickly to Changing Roles and Environments
This is Where IT Change Starts.
  • With questions.
  • With problems that need to be solved.
  • With business needs, both immediate and long term.
  • Because technology and its impact on business isn’t getting any simpler.
  • That’s why we exist.
  • To answer the tough questions.
  • To find a solution to every problem—no matter the size or scope.
  • And to help companies not just identify IT’s potential, but realize IT



 
Share

Apply for this position

Required*
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Human Check*