Samiksha - Behind the Scenes

Chakshu Gautam

Chakshu Gautam

Technology @Samagra

This is a multipart behind-the-scenes series on how we built Samiksha and how this led to the creation of abstracted reusable technology components that we have today.

1. Problem Statement

Samiksha is an Open Source product designed to improve school quality control activities conducted by state education department officials in government schools. This typically includes mentoring to support teachers based on observations of their in-class teaching practices, and monitoring to inspect the status of infrastructure and quality of school processes such as classroom teaching, availability of teaching learning materials, functional toilets, completion of civil works projects in the school and so on.

Translating this to tech, it is essentially an ecosystem that gives

  • The administration console to build and deploy forms dynamically without updating the app. This also gives you the ability to manage roles, resources and permissions for individual users. This initial work essentially evolved into the Application console.

  • The school mentors and monitors an application to fill forms in a secure way. The is what later became part of the Mobile Application

  • the program owner access to all the raw data on which he can perform analytics. This analytics and visualization part is the Data Analytics package that we have right now.

2. Design Principles

Our approach to building software is heavily influenced by the need to empower government engineering teams with technology products built using Open Source tools that they could independently manage in the long term thereby reducing public expenditure, having confident deployments at scale, with any technology, infrastructure or vendor lock-ins. This is why

  • Principle #1 - Build reusable, scalable, extensible & deployable technology products using Open Source tools
  • Principle #2 - Work with existing government systems, don’t replace them
  • Principle #3 - Strengthen government technology capability, don’t replace it

I will be focussing on Principle #1 here. More importantly, how we chose the exact same Open Source projects to power our program.

3. Navigating in the Open Source World

Exploring what is already there

When we set out to build an open source application, we knew that there are four major components to it. The four major components for Samiksha are the following.

  1. The Form management module

  2. Authentication and Authorization server

  3. Aplication Control Module to manage and use data

  4. Data Analytics

Architecture This is how they connect in the overall structure.

The challenge with a lot of Open Source projects is not the quality, but interoperatability. Even when you are able to find something that is in a language that you know, you are not able to figure out how to make customizations to the code due to bad (and mostly no) documentation on the code. Often it is more a reflection on what skillset and frameworks your team knows and how cutting edge you want (can afford) to be.

Selecting Open Source projects is a science in its own right. With so many choices, it is inevitable that you would end up with decision fatigue.

Chosing an open source project

What it is like when you are selecting an Open Source project

If you just go by the number of Open Source solutions you can build, it would be 20^4 (assuming 20 legit candidates for each). So how do you choose one out of 1.6 lakh choices that you have? How do you narrow down choices?

Defining the criteria

Chosing an open source project

Why create a new new standard than using something already there?

Even before setting out to select the project, having a concrete idea of what we were looking for in the first place was essential. We listed down the requirements that were essential for each of the components and started mapping them to Open Source projects. But this was not enough, since a lot of projects were having similar features, so evolved the criteria to include some other parameters to make it more robust so that even a Github bot could check this. Since we were all willing to learn something new (even a new language), the criteria (in priority) apart from the core features that we were looking for was as follows

Must have

  • Active, funded community with frequent (at least a monthly) release - Since we are not a big team, contributing on something straight away making multiple PRs was not something that we could have done. Having an active community leverages their expertise and helps in getting solutions others have already custom built for themselves. Having a huge stackoverflow repository of questions, in a similar way keeps us agile.

  • Having a fully featured GUI - Debugging based on logs was not something that we could afford. Also having a GUI implemented, gives an example to how to implement the APIs in other use cases.

  • Code quality and maturity of the project - Anything less than an year was not considered.

  • An open licence - Open licences like MIT are favourable than others.

  • Docker based deployment - Deployment being one our design principles, anything without a container was a no go.

Good to have

  • Issues and PRs raised - PRs and issues getting closed at a fast rate, was a metric that becomes a proxy of how fast our own issues will be resolved.

  • Continuous integration status - CI/CD enables more frequent code deployments and helps check for bad code commits. All pushes to master should go through a pipeline. An Open Source project without one feels unsafe.

  • A CONTRIBUTING doc - Since we would be modifying a lot of what was already built, it was necessary to have a starting doc that would enable newcomers like us to ramp up faster.

  • Tests - Tests are essentially an indicator of robustness of the system. But not a deal breaker, they can be added later.

  • Benchmarking - For a low feature phone taking 100mb for just one app won't fly. Having benchmarks ensures the project is on the path of continuous improvement.

  • Code styling - This causes troubles for the contributors, 'cause one prefers spaces and another prefers tabs'

Though stargazers is something that people generally look for as a proxy for all of the above, we went a step ahead since we won't be able to switch to anything else for a long time. We would also be modifying some of the features heavily so all of the above things were essential for us.

The result?

When we were done evaluating everything, we were left with these four choices.

Component/ModuleChoice 1Choice 2Choice 3
1Form managementODK
2Identity ServerFusionAuthGluuOpenAM
3Application ConsoleReactAdmin
4Analytics and VisualizationMetabaseSuperset

Now since we had just 6 choices, we narrowed down to one based on the stack that we felt most comfortable with. React being one of them, Java being the other. So the final choices were, FusionAuth for Identity Server and Metabase for Visualization.

Now that we had made a bet on the above platforms, the next step was to figure out how to seamlessly integrate all these technologies such that everything is secure and ready to go. But we will take this up in the next post.

References

  1. https://www.freecodecamp.org/news/why-im-not-using-your-github-repository-2dff6c7ac7cf/
  2. https://itnext.io/what-i-dont-like-in-your-repo-a602577a526b
  3. https://towardsdatascience.com/how-to-choose-the-best-open-source-software-b1cbbe4f6398