PhD position in "Consensus algorithms for distributed storage systems" - MSCA Cofund SEED programme

Updated: 3 months ago
Location: Nantes, PAYS DE LA LOIRE
Job Type: FullTime
Deadline: 14 Feb 2024

2 Feb 2024
Job Information
Organisation/Company

IMT Atlantique
Department

Doctoral division
Research Field

Engineering
Researcher Profile

First Stage Researcher (R1)
Country

France
Application Deadline

14 Feb 2024 - 12:00 (Europe/Paris)
Type of Contract

Temporary
Job Status

Full-time
Hours Per Week

37
Offer Starting Date

1 Sep 2024
Is the job funded through the EU Research Framework Programme?

HE / MSCA COFUND
Marie Curie Grant Agreement Number

101126640
Is the Job related to staff position within a Research Infrastructure?

No

Offer Description
Offer Description
The PhD position is offered under a standard track (30 months at IMT Atlantique + 3 months at the SCI institute of the University of Utah + 3 months at Deuxfleurs, France.)
1.1. Domain and scientific/technical context

Low latencies connections and decentralized servers are currently showcasing a new potential for distributed computing. Particularly, mobility and intermittent connectivity of computing resources create a need for distributed storage mechanisms resilient to network isolation.

However, developing integrated systems that are capable of exploiting highly distributed resources requires developers and service providers to deal with the unreliability of the compute nodes and of the network infrastructure, and must be considered during the design phase of systems. Additionnally, more pressing constraints on energy and resource consumptions will foster the need for distributed computation with restricted capabilities, for example relying on small server nodes that are turned off or disconnected most of the time.


1.2. Scientific/technical challenges

Coordination and consensus problems are at the core of distributed algorithms. In the context of server-side infrastructure and especially highly-distributed storage systems, we identify two main contributions as part of this topic proposal :

  • Leaderless consensus for server-side software: many distributed algorithms deployed today rely on strong coordination and leader elections: this is a costly approach that is not compatible with unreliable compute nodes and network. In contrast, weak coordination approaches and leaderless consensus are appealing for this situation. However, they have mostly been applied in the context of client-side local-first applications. We plan to extend this work to develop better lightweight server-side distributed software with faster access time and lightweight resource consumption even in situations that make coordination challenging.
  • CRDTs for storage and cluster systems: when an even weaker form of coordination is tolerable, conflict-free replicated data types (CRDT) [3] provide good system support for intermittent connectivity. CRDTs are used in synchronisation schemes, as replicas can be updated independently and concurrently without direct coordination [4,5]. We plan to investigate and formalize the use of CRDTs in storage systems and virtual clusters.

  • 1.3. Considered methods, targeted results and impacts

    The main motivating use case for this work is the Garage software, an open-source distributed object storage service tailored for highly-distributed infrastructures [6]. Garage is developped by Deuxfleurs.

    Garage already uses CRDTs to tolerate network disconnections, but would sometimes require a stronger coordination model. For example, Garage currently allows two users to create conflicting storage spaces on two different nodes, and the conflict is only discovered when the changes are propagated. A lightweight "leaderless consensus" approach would provide more guarantees, while being less sensitive to latency and network disconnections compared to Paxos or Raft.

    Another interesting problem in Garage is : how to maintain the consistency of data replicas when storage nodes are added or removed? Any membership change may cause an update of the location of replicas, but it will take time to actually move the data. During this time, all nodes need to maintain the desired level of consistency while accounting for both the old and new location. This problem has not been thoroughly studied for the read-after- write consistency model used in Garage.

    The proposed topic is expected to contribute to the algorithmic state-of-the-art around distributed storage systems, which would indirectly benefit all such systems. Another goal of the thesis is to implement the proposed solutions in the Garage software itself.

    This work can also be applied to virtual clusters (Namespaces) in Kubernetes-like software stacks. Namespaces presents services, and deployments users use to build and run their applications. The ability to efficiently deploy a virtual cluster over geographically distributed resources could enable collaboration between containers by exposing a resource created on one site to another one with minimal code changes.


    2. Partners and study periods
    2.1. Supervisors and study periods
    • IMT Atlantique: Dr. Daniel Balouek , Inria research scientist, IMT Atlantique, Nantes, France

      The PhD student will stay 30 months at Dr. Balouek's lab.

    • International partner: The PhD student will probably be hosted three months at the SCI institute of the University of Utah. However, this partner may still change.
    • Industrial partner(s): Dr. Alex Auvolat , research engineer, DeuxFleurs

      The PhD student will be hosted three months at Deuxfleurs.


    2.2. Hosting organizations
    2.2.1. IMT Atlantique

    IMT Atlantique , internationally recognized for the quality of its research, is a leading French technological university under the supervision of the Ministry of Industry and Digital Technology. IMT Atlantique maintains privileged relationships with major national and international industrial partners, as well as with a dense network of SMEs, start-ups, and innovation networks. With 290 permanent staff, 2,200 students, including 300 doctoral students, IMT Atlantique produces 1,000 publications each year and raises 18€ million in research funds.


    2.2.2. Deuxfleurs

    Deuxfleurs  is a French association that is working towards changing the Internet experience towards a convivial Internet.


    Requirements
    Research Field
    Engineering
    Education Level
    Master Degree or equivalent

    Skills/Qualifications

    The topic involves several fields: distributed algorithmics, implementation of these algorithms in an existing software (Garage), and applications to real-life large-scale distributed systems. It will also involve large-scale experiments using research platforms such as SLICES-RI.


    Languages
    ENGLISH
    Level
    Excellent

    Research Field
    Engineering

    Additional Information
    Benefits
    A PhD programme of high quality training : 4 reasons to apply
    • SEED is a programme of excellence that is aware of its responsibilities: to provide a programme of high quality training to develop conscientious researchers, including training in responsible research and ethics. 
    • SEED’s unique approach of providing interdisciplinary, international and cross-sector experience is tailored to work in a career-focused manner to enhance employability and market integration.
    • SEED offers a competitive funding scheme, aiming for an average monthly salary of EUR 2,000 net per ESR, topped by additional mobility allowances as well as optional family allowances.
    • SEED is a forward-looking programme that actively engages with current issues and challenges, providing research opportunities addressing industrial and academic relevant themes.

    Eligibility criteria

    Eligibility criteria. In accordance with MSCA rules, SEED will open to applicants without any conditions of nationality nor age criteria. SEED applies the MSCA mobility standards and necessary background. Eligible candidates must fulfil the following criteria

    • Mobility rule: Candidates must show transnational mobility by having not resided or carried out their main activity (work, studies, etc.) in France for more than 12 months in the three years immediately before the deadline of the co-funded program's call (Jan 31, 2024 for Call#1). Compulsory national service, short stays such as holidays and time spent as part of a procedure for obtaining refugee status under the Geneva Convention are not taken into account.
    • Early-stage researchers (ESR): Candidates must have a master’s degree or an equivalent diploma at the time of their enrolment and must be in the first four years (full-time equivalent research experience) of their research career. Moreover, they must not have been awarded a doctoral degree.
      Extensions may be granted (under certain conditions) for maternity leave, paternity leave, as well as long-term illness or national service.

    Selection process

    The selection process is described on the guide for applicants available here: https://www.imt-atlantique.fr/en/research-innovation/phd/seed/documents


    Additional comments

    Applications can only be provided through the application system available under the SEED website: https://www.imt-atlantique.fr/seed


    Website for additional job details

    https://www.imt-atlantique.fr/en/research-innovation/phd/seed

    Work Location(s)
    Number of offers available
    1
    Company/Institute
    IMT Atlantique
    Country
    France
    City
    Nantes
    Street
    4, rue Alfred Kastler - La Chantrerie
    Geofield


    Where to apply
    Website

    https://www.imt-atlantique.fr/en/research-innovation/phd/seed

    Contact
    City

    Nantes
    Website

    https://www.imt-atlantique.fr/en/research-innovation/phd/seed
    Street

    4, rue Alfred Kastler
    E-Mail

    [email protected]

    STATUS: EXPIRED

    Similar Positions