Job Requirements
Remote San Antonio, TX
Top Secret Polygraph not specified
Career Level not specified
$140,000 - $180,000
Job Description
Big Data Database Engineer (Accumulo Specialist)
Location: Hybrid (San Antonio, TX area)
Work Model: Onsite with initial remote flexibility (2–3 month ramp period)
Overview
An enterprise federal program is seeking a highly specialized Big Data Database Engineer with deep expertise in Apache Accumulo to support a mission-critical distributed data platform. This role is focused on the sustainment, optimization, security, and modernization of large-scale data environments operating across multiple classification levels.
This is a hands-on engineering role requiring true subject matter expertise in Accumulo and its supporting ecosystem—not a general database administrator.
Key Responsibilities
Accumulo Engineering & Administration
Administer and optimize distributed Accumulo clusters, including core services and system components
Perform advanced tuning of compactions, tablet management, and table-level configurations
Design and maintain table structures, iterators, partitioning strategies, and performance settings
Troubleshoot complex system-level issues including iterator failures, class loading issues, and runtime errors
Distributed Systems Management
Configure and maintain Hadoop ecosystem components including HDFS, Zookeeper, and YARN
Optimize distributed storage and compute performance for high-volume ingestion and analytics workloads
Ensure proper integration and stability between platform components
Data Security & Access Control
Implement and enforce fine-grained data security controls, including cell-level access restrictions
Ensure compliance with multi-classification data handling requirements
Support secure data flows across enterprise environments
Backup, Recovery & Sustainment
Manage backup and recovery strategies across distributed data platforms
Support operations across multiple network environments with differing classification levels
Maintain system reliability and support ongoing modernization efforts
System Diagnostics & Optimization
Analyze logs and performance metrics to identify and resolve ingest bottlenecks and system constraints
Diagnose dependency issues across distributed services
Partner with engineering teams to improve system performance and scalability
Required Qualifications
Active TS/SCI clearance or TS with SCI eligibility
Proven, hands-on experience engineering and administering Apache Accumulo in enterprise environments (required)
Strong experience with distributed systems, including HDFS, Zookeeper, and YARN
Background in Hadoop ecosystem configuration, troubleshooting, and performance tuning
Proficiency with Python and/or Java for system interaction and debugging
Ability to troubleshoot deep technical issues within distributed data platforms
Preferred Qualifications
Experience supporting large-scale data platforms for analytics, ingestion, or cyber-related use cases
Familiarity with mission-driven or highly secure data environments
Experience developing automation for database management, monitoring, and maintenance workflows
Exposure to Java-based application stacks that integrate with distributed data stores
Location: Hybrid (San Antonio, TX area)
Work Model: Onsite with initial remote flexibility (2–3 month ramp period)
Overview
An enterprise federal program is seeking a highly specialized Big Data Database Engineer with deep expertise in Apache Accumulo to support a mission-critical distributed data platform. This role is focused on the sustainment, optimization, security, and modernization of large-scale data environments operating across multiple classification levels.
This is a hands-on engineering role requiring true subject matter expertise in Accumulo and its supporting ecosystem—not a general database administrator.
Key Responsibilities
Accumulo Engineering & Administration
Administer and optimize distributed Accumulo clusters, including core services and system components
Perform advanced tuning of compactions, tablet management, and table-level configurations
Design and maintain table structures, iterators, partitioning strategies, and performance settings
Troubleshoot complex system-level issues including iterator failures, class loading issues, and runtime errors
Distributed Systems Management
Configure and maintain Hadoop ecosystem components including HDFS, Zookeeper, and YARN
Optimize distributed storage and compute performance for high-volume ingestion and analytics workloads
Ensure proper integration and stability between platform components
Data Security & Access Control
Implement and enforce fine-grained data security controls, including cell-level access restrictions
Ensure compliance with multi-classification data handling requirements
Support secure data flows across enterprise environments
Backup, Recovery & Sustainment
Manage backup and recovery strategies across distributed data platforms
Support operations across multiple network environments with differing classification levels
Maintain system reliability and support ongoing modernization efforts
System Diagnostics & Optimization
Analyze logs and performance metrics to identify and resolve ingest bottlenecks and system constraints
Diagnose dependency issues across distributed services
Partner with engineering teams to improve system performance and scalability
Required Qualifications
Active TS/SCI clearance or TS with SCI eligibility
Proven, hands-on experience engineering and administering Apache Accumulo in enterprise environments (required)
Strong experience with distributed systems, including HDFS, Zookeeper, and YARN
Background in Hadoop ecosystem configuration, troubleshooting, and performance tuning
Proficiency with Python and/or Java for system interaction and debugging
Ability to troubleshoot deep technical issues within distributed data platforms
Preferred Qualifications
Experience supporting large-scale data platforms for analytics, ingestion, or cyber-related use cases
Familiarity with mission-driven or highly secure data environments
Experience developing automation for database management, monitoring, and maintenance workflows
Exposure to Java-based application stacks that integrate with distributed data stores
group id: kforcecx
We offer roles across all three clearance levels: Confidential, Secret and Top Secret. With a Top Secret Facilities clearance, a proven subcontractor track record and a deep understanding of agencies across Defense, Intelligence, Homeland, Justice and Federal Civilian Sectors, Kforce brings more than 20 years of experience to supporting critical missions at federal, state and local levels.