The Wayback Machine - https://web.archive.org/web/20151025040137/https://archive.org/about/jobs.php
Universal Access To All Knowledge
Home Donate | Store | Blog | FAQ | Jobs | Volunteer Positions | Contact | Bios | Forums | Projects | Terms, Privacy, & Copyright
Anonymous User (login or join us)
Upload

Job Opportunities at the Internet Archive

  • Senior Engineer: Wayback Machine
  • Web Application Developer (Books)
  • Software Engineer: Cluster Storage & Computing
  • Senior Python Engineer
  • Software Engineer: Cluster Storage & Computing (Remote)
  • Senior Engineer: Wayback Machine

    About the Internet Archive: Internet Archive is a 501(c)(3) non-profit library founded in 1996. Our motto is Universal Access to All Knowledge. We collect web sites, books, audio, videos, software, and other types of media and make them available to the world for free. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future--a place where we can all go to learn and explore. Our 140 engineers, book scanners, librarians, designers and team members have built the #250 website in the world, https://archive.org.

    Location: San Francisco, CA

    Job Classification: Full-time, exempt

    Job Summary: The Internet Archive's Wayback Machine is the world's largest public archive of historical web sites. Have you ever wanted to work with 450 billion things at once? Would you like to serve 1,500 requests per second? How about having your service referred to regularly in news articles and blog posts across the web? You can work on a challenging and popular project and help the world at the same time.

    We are looking for a smart, collaborative and resourceful engineer to help develop the next version of the Wayback Machine. The ideal candidate will possess a desire to work collaboratively with a small internal team and a large, vocal and active user community; demonstrating independence, creativity, initiative and technological savvy, in addition to being a great programmer/architect.

    Minimum Qualifications:

    • 2-3 years work experience in Python, or similar
    • Experience working in Linux environments
    • Familiarity with Java (current deployment is written in Java)
    • Good understanding of latest web framework technologies and aspects of web technology and protocols
    • Flexibility and a sense of humor
    • BS Computer Science, or equivalent work experience

    Preferred Qualifications:

    • Experience with web crawlers and/or applications designed to display archived web content (especially server-side apps)
    • Cluster computing experience
    • Open source practices experience

    To apply: Please send your resume and cover letter to Jobs+SeniorWaybackEngineer@archive.org with the subject line "Wayback Machine Senior Engineer."

    Internet Archive is an Equal Opportunity Employer. Internet Archive complies with the Fair Chance Ordinance.

    Internet Archive is an Equal Opportunity Employer. Internet Archive complies with the Fair Chance Ordinance.

    Web Application Developer (Books)

    About the Internet Archive: The Internet Archive is a non-profit with a huge mission: to give everyone access to all knowledge - the books, web pages, audio, television and software of our shared human culture. Forever. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future - a place where anyone can go to learn and explore. Our 140 engineers, book scanners, librarians, designers and team members have built one of the top websites in the world, https://archive.org.

    Location: San Francisco, CA

    Job Classification: Full-time, exempt

    Job Summary: The Internet Archive has a huge corpus of digital information. Every day, our team of development engineers creates tools and applications to work with 22 petabytes of content that includes millions of books and texts, millions of hours of video, millions of audio tracks, and over 450 billion web captures. We are looking for a smart engineer to help develop next generation of web-based applications used by libraries and archives around the world to build and manage archives of books, texts, and image content. The ideal candidate will be a strong programmer who has successfully completed several projects involving large or intricate web applications or services, and will work collaboratively with talented engineering colleagues.

    Key Responsibilities: The responsibilities of this position are to develop and evolve the internal tools that support the review, editing and re-publishing of books and texts that have ben scanned into digital formats. More specifically, this means:
      • Work at the direction of the technical project leader to create the next generation toolset for editing and assembling digital texts.
      • Contribute to the documentation, training, maintenance, and evolution of this and other tools.
      Minimum Qualifications:
      • 2-3 years work experience with Javascript, HTLM5, and CSS.
      • Experience working with image file data structures.
      • Experience with building or maintaining APIs.
      • Good understanding of latest web framework technologies and aspects of web technology and protocols.
      • Fluency in Linux environments
      • Flexibility and a sense of humor

      Preferred Qualifications:

      • Experience with scanning and digital book formats (PDF, epub, AZW, etc.).
      • Strong programming experience in PHP, Python, or both.
      • Experience working with iiif image presentation framework.
      • Experience open source practices and participation in open source forums
      • Specific experience with Atlassian tool sets (Jira, Confluence)

      The Web Application Developer reports to the Director of Engineering and will work closely with the digital book teams.The entire staff is guided by founder and Digital Librarian, Brewster Kahle. Like many Bay Area technology firms, we offer staffers a catered lunch every Friday, massages, espresso bar, endless M&Ms; and working in the coolest office space in SF. But more importantly, you will be helping to make sure we bring out cultural treasures into the digital age, while preserving our present and future. Read more about the Internet Archive in Medium and The New Yorker

      To Apply: Please send your resume and cover letter to Jobs+WebAppDeveloper@archive.org with the subject line "AE-104: Web Application Developer"

      Internet Archive is an Equal Opportunity Employer. Internet Archive complies with the Fair Chance Ordinance. Internet Archive is a 501(c)(3) non-profit library founded in 1996.

      Software Engineer: Cluster Storage & Computing

      About the Internet Archive: The Internet Archive is a non-profit with a huge mission: to give everyone access to all knowledge - the books, web pages, audio, television and software of our shared human culture. Forever. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future--a place where anyone can go to learn and explore. Our 140 engineers, book scanners, librarians, designers and team members have built one of the top websites in the world: https://archive.org.

      Location: San Francisco, CA

      Job Classification: Full-time, exempt

      Job Summary: The Internet Archive has a huge corpus of digital information...all running across and integrated cluster of over 700 VMs. Every day, our team of development engineers creates tools and applications to work with 20 petabytes of content replicated over 3 data centers (50 petabytes raw storage total). This includes millions of books and texts, millions of hours of video, millions of audio tracks, and over 450 billion web captures. We are looking for a smart engineer with deep experience in "back-end" systems development and a passion for modern and evolving technologies and approaches to compute/data cluster operations. The ideal candidate will be a quick learner and have demonstrated thoughtful coding technique with good consideration to ensuring the reliability and performance of large/complicated web applications or services. They will also be excited to work collaboratively with talented engineering colleagues.

      Key Responsibilities: This position will work with the Principal Cluster Operations Engineer to develop, extend, and maintain tools and applications that add, verify, and modify the content of our storage cluster. This includes:

      • Create, update, and extend tools and applications that support the flow of content within the storage cluster
      • Monitor and tune job processing queues to ensure efficient and complete execution of long-running processes
      • Define, communicate, and enforce the processes to ensure the integrity and preservation of data
      • Analyze current challenges in our development and operational process and infrastructure(s) and propose, validate, and implement effective improvements
      • Participate in monitoring, maintain and (when necessary) restoring the health of the storage and compute cluster

      Minimum Qualifications:

      • Demonstrated experience creating production code in PHP or Python (and ideally both)
      • Fluency in Linux environments
      • Scripting and/or programming skills and the ability to develop custom tool integrations
      • Interest in coding for cluster-computing environments
      • Familiarity with software development tools (such as Git-Hub) and monitoring tools (Nagios, Graphite, etc.)
      • Takes pride in keeping current with industry discoveries and evolutions
      • Basic understanding of cluster-centric networking (UDP and TCP/IP protocols)
      • Flexibility and a sense of humor

      Preferred Qualifications:

      • Specific experience with Jira
      • Experience open source practices and participation in open source forums
      • Understanding of the functions of load balancers, web and DNS servers, etc. Able to troubleshoot networking and web services issues using nginx and Apache logs as well as network packet captures
      • Familiarity with database and search technologies, including Redis, Postgres, ElasticSearch, and SOLR
      • Working knowledge of JavaScript
      • BS in Computer Science or equivalent work experience
      • Habitually explore new tools and ideas and incorporate them into your everyday work

      Reporting Structure: Software Engineering positions report to the Director of Engineering and will work closely with both the cluster infrastructure team and web application development teams. The entire staff is guided by founder and Digital Librarian, Brewster Kahle. Like many Bay Area technology firms, we offer staffers a catered lunch every Friday, massages, Blue Bottle Coffee, endless M&Ms; and working in the coolest office space in SF. But more importantly, you will be helping to make sure we bring out cultural treasures into the digital age, while preserving our present and future. Read more about the Internet Archive in Medium and The New Yorker

      To Apply: Please send your resume and cover letter to jobs+ClusterStorageComputing@archive.org with the subject line "CO-201: Software Engineer: Cluster Storage & Computing"

      Internet Archive is an Equal Opportunity Employer. Internet Archive complies with the Fair Chance Ordinance.

      CRM Application Specialist

      About the Internet Archive: The Internet Archive is a non-profit with a huge mission: to give everyone access to all knowledge - the books, web pages, audio, television and software of our shared human culture. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future--a place where anyone can go to learn and explore. Our 150 engineers, book scanners, librarians, designers and team members have built one of the top websites in the world https://archive.org.

      Location: San Francisco, CA

      Job Classification: Full-time, exempt

      Job Summary: The Internet Archive has a huge corpus of digital information. This includes millions of books and texts, millions of hours of video, millions of audio tracks, and over 450 billion web captures. We are looking for a smart developer with experience in web development and administrative systems (specifically CiviCRM). The ideal candidate will be experienced in CiviCRM, comfortable with manipulating data, a quick learner and have at least a basic familiarity with coding. They will also be excited to work collaboratively with our talented administrative team and engineering colleagues.

      Key Responsibilities:

      This position will work with the administrative team, the development teams, and external consultants to implement CiviCRM and supporting web development for Internet Archive donation programs. Additionally, this individual may work to integrate, extend, and maintain administrative tools and applications and services as requested throughout Internet Archive. This includes:

      • Serve as the primary technical contact for CiviCRM activities and requests
      • Accurately translate requirements to support Donor activities (communicated from the administrative team) into appropriate configuration of CiviCRM
      • As necessary, work to integrate CiviCRM with other data feeds and support tools
      • Implement user-facing web pages or (html email) to support donor communication and contribution
      • Work on engineering projects as directed

      Minimum Qualifications:

      • CiviCRM experience or experience configuring complex web applications for production use
      • Experienced and comfortable with "data wrangling"
      • Comfort in Linux environments
      • Scripting and basic programming skills and the ability to develop custom tool integrations
      • Comfortable using electronic means of communication with others (instant messaging, email, etc.)
      • Takes pride in keeping current with industry evolution
      • Flexibility and a sense of humor

      Preferred Qualifications:

      • Specific experience with production implementation of CiviCRM systems
      • Specific experience with Jira
      • Demonstrated capability to program in Python
      • Web development experience (using Javascript and other web site technologies)
      • Experience working with ERP systems
      • Experience open source practices and participation in open source forums
      • Habitually explore new tools and ideas and incorporate them into your everyday work

      Reporting Structure: Development positions report to the Director of Engineering. The entire staff is guided by founder and Digital Librarian, Brewster Kahle. Like many Bay Area technology firms, we offer staffers a catered lunch every Friday, massages, Blue Bottle Coffee, endless M&Ms; and working in the coolest office space in SF. But more importantly, you will be helping to make sure we bring out cultural treasures into the digital age, while preserving our present and future. Read more about the Internet Archive in Medium and The New Yorker.

      To Apply: Please send your resume and cover letter to jobs+AdminWebApp@archive.org with the subject line "D-104: Developer, Administrative Applications"

      Internet Archive is an Equal Opportunity Employer. Internet Archive complies with the Fair Chance Ordinance. Internet Archive is a 501(c)(3) non-profit library founded in 1996.

      Senior Python Engineer

      Location: Inner Richmond, San Francisco, CA

      Job Classification: Full-time, exempt

      Job Summary: The Internet Archive is one of the world's largest public archive of historical web sites and digital information. This includes millions of books and texts, millions of hours of video, millions of audio tracks, and over 450 billion web captures. We are looking for smart engineers with deep experience in web application development and a passion for modern technologies and approaches software development. The ideal candidate will be a quick learner and have demonstrated thoughtful coding technique with good consideration to ensuring the reliability and performance of large web applications or services. They will also be excited to work collaboratively with talented engineering colleagues. You can work on a challenging and popular project and help the world at the same time.

      Key Responsibilities: The duties of this position are to develop software applications that support the Digital Books operations. This will entail coding (mostly in Python and JavaScript), system analysis, application design, and other technical duties.

      Minimum Qualifications:

      • 2-3 years extensive work experience in Python
      • 2-3 years work experience with JavaScript
      • Experience working in Linux environments and GitHub.
      • Good understanding of latest web framework technologies and aspects of web technology and protocols
      • Flexibility and a sense of humor
      • BS Computer Science, or equivalent work experience

      Preferred Qualifications:

      • Experience building web applications that access and update large digital collections (as of ebook, digital recordings, etc.
      • Experience in Open source practices.

      Reporting Structure: Senior Python Developer report to the Director of Engineering and will work closely with the Books team to define, detail and conceptualize book applications.

      To apply: Please send your resume and cover letter to mailto:Jobs+SeniorPythonEngineer@archive.org with the subject line "AE-304: Senior Python Developer."

      Internet Archive will consider for employment-qualified applicants with criminal histories in a manner consistent with the requirements of the Fair Chance Ordinance.

      About the Internet Archive: The Internet Archive is a non-profit with a huge mission: to give everyone access to all knowledge—the books, web pages, audio, television and software of our shared human culture. Forever. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future--a place where we can all go to learn and explore. Our 140 engineers, book scanners, librarians, designers and team members have built the #200 website in the world. (https://archive.org). Like other Bay Area technology firms, we offer staffers a catered lunch every Friday, massages, Blue Bottle Coffee, endless M&Ms; and working in the coolest office space in SF. But more importantly, you will be helping to make sure we preserve the history of the Digital Age.

      Software Engineer: Cluster Storage & Computing (Remote)

      A unique opportunity to join the team of the Internet Archive as a remote Web Application Developer located in the Columbus, OH area

      About the Internet Archive: The Internet Archive is a non-profit with a huge mission: to provide universal access to all knowledge—the books, web pages, audio, television and software of our shared human culture. Based in San Francisco and with satellites around the world, the Internet Archive is collaborating with library partners worldwide to build the digital library of the future--a place where anyone can go to learn and explore. Our 140 engineers, book scanners, librarians, designers and other team members have built one of the top websites in the world. Check us out at www.archive.org.

      Software Engineer: Cluster Storage & Computing: The Internet Archive is currently seeking a remote Cluster Storage & Computing Engineer with deep experience in “back-end” systems development, and a passion for modern and evolving technologies in compute/data cluster operations. The ideal candidate will be a quick learner and have demonstrated thoughtful coding technique ensuring the reliability and performance of large/complicated web applications or services. They will also be excited to work collaboratively with talented engineering colleagues.

      Location: Columbus, OH Job Classification:Full-time, remote

      Job Purpose and Responsibilities: The Cluster Storage & Computing Engineer works from home in the Columbus area, with a distributed team of talented developers, and makes occasional trips to San Francisco headquarters as needed. He or she will work with the Principal Cluster Operations Engineer to develop, extend, and maintain tools and applications that add, verify, and modify the content of our storage cluster. This includes:

      • Creating, updating and extending tools and applications that support the flow of content within the storage cluster
      • Monitoring and tuning job processing queues to ensure efficient and complete execution of long-running processes
      • Defining, communicating, and enforcing processes to ensure the integrity and preservation of data
      • Analyzing current challenges in our development and operational process, and proposing, validating, and implementing effective improvements.
      • Participating in monitoring, maintaining and (when necessary) restoring the health of the storage and compute cluster

      Minimum Qualifications:

      • Demonstrated experience creating production code in PHP or Python (and ideally both)
      • Fluency in Linux environments
      • Scripting and/or programming skills and the ability to develop custom tool integrations
      • Interest in coding for cluster-computing environments
      • Familiarity with software development tools (such as Git-Hub) and monitoring tools (Nagios, Graphite, etc.)
      • Takes pride in keeping current with industry discoveries and evolutions
      • Basic understanding of cluster-centric networking (UDP and TCP/IP protocols).
      • Flexibility and a sense of humor

      Preferred Qualifications:

      • Specific experience with Jira
      • Experience open source practices and participation in open source forums
      • Understanding of the functions of load balancers, web and DNS servers, etc. Able to troubleshoot networking and web services issues using nginx and Apache logs as well as network packet captures.
      • Familiarity with database and search technologies, including Redis, Postgres, ElasticSearch, and SOLR
      • Working knowledge of JavaScript
      • BS in Computer Science or equivalent work experience
      • Habitually explore new tools and ideas and incorporate them into your everyday work
      Thanks for your interest! Qualified only, please. We look forward to hearing from you.


Terms of Use (31 Dec 2014)