<p>DomainTools is seeking a <strong>R&D Data Engineering Intern</strong>. This role is intended for those seeking to hone their development and data analysis skills as they begin their career. A successful candidate will be well organized, collaborative, and experienced in working with remote teams. </p><p>As our intern, you will be part of a critical team supporting production machine learning pipelines, providing development and ad-hoc support to our business. <strong>Responsibilities include</strong>: assisting in data hygiene projects and documentation to maintain data integrity, automating processes, and supporting the R&D team on other special projects as needed. These valuable opportunities provide hands-on experience, allowing you to put your educational knowledge into action and lay a solid foundation for your future. The role provides an excellent learning opportunity specifically for those interested in internet security, machine learning, and production development patterns.</p><p>The right person for us will have a support oriented mentality, wanting to enable organizations to make better business decisions and improve efficiency. </p><p></p><p><strong>Key Responsibilities:</strong></p><ul><li>Data cleaning and preparation to ensure our machine learning pipelines remain accurate and reliable. </li><li>Update and maintain code to ensure our system stays compatible with evolving data sources. </li><li>Develop and improve tools to monitor data health and help the team explore new ways to use our datasets. </li></ul><p><strong>Requirements</strong></p><p><strong>Qualifications & Requirements</strong></p><ul><li><strong>Strong Organizational Skills</strong>: Ability to manage your tasks and schedule effectively. </li><li><strong>A Strong Attention to Detail</strong>: A sharp eye for spotting inconsistencies in data and a commitment to high-quality documentation. </li><li><strong>Clear and Precise Communication Skills</strong>: The ability to share updates and collaborate effectively with a remote team. </li><li><strong>Development Skills</strong>: Familiarity with python, git. Ideally also familiar with Spark/PySpark.</li><li><strong>Preferred</strong>: Knowledge of computer networks, including DNS, domain names, and IP addresses</li></ul><p></p><p><strong>Time Commitment</strong></p><ul><li><strong>Hours Per Week</strong>: 5-15 hours, depending on availability </li><li><strong>Working Hours</strong>: Can be flexible but a regular check-ins are required during business hours, US/Eastern time</li></ul><p><strong>Benefits</strong></p><p>This is an <strong>unpaid internship</strong> offered for <strong>academic credit only</strong>; monetary compensation is not available. The primary goal is the intern's <strong>education and training</strong>, not to generate immediate advantage for the employer. The experience is designed to provide valuable, hands-on learning similar to an educational environment, and there is no guarantee of a paid position at the conclusion of the internship. The intern will work under close supervision of existing staff and will not displace regular employees. </p><p></p><p></p>