Student Network Optimization

Student talking in teams to connect with other students

Student Network Optimization focuses on connecting students to each other with different profiles. An art student could learn a different perspective on their work from a business lead and vice-versa. This project focused on providing a tool to give an indication for student collaboration. To expand their network and knowledge.

1. Background

Prior to starting any project, it is crucial to get good background information on the environment. The project started as an FHICT-Delta experiment. Delta is the excellence track for Fontys ICT students. In this track, students get the possibility to handle projects with almost full autonomy.  Lecturers are rarely involved with projects, only on request of a student and in situations where the students could fail the semester. This results in an environment where students explore their own skills and the context they want to work with. As a Delta student myself, I started to focus on data and organization processes. During my second semester as a Delta, I started to notices that there are clusters of students that often work together. The arts & media-, software and technology clusters are easy to pick out once you start to work on projects. As these clusters begin to form, they focus more and more on their issues and leave other perspectives on their work to the side. If not approached at an early moment, a division of knowledge and experience occurs within student groups will occur.
 
The goal of the project is to create a product that can connect students of different backgrounds to each other based on current collaborations.
Based on the information provided, the task of the project were as follows:
  • Obtain data,
  • Standardize the data,
  • Create metrics to quantify student network diversity,
  • Develop a dashboard to assist students.

2. Data obtainment

At the beginning of a semester, all Delta students get together to select the projects they want to work on. This is done by putting all the projects on a whiteboard and pitching the projects to other students. After the project has been pitched, students get to select the projects they want to work on by adding their name to the whiteboard. After the event, a Delta lecturer takes pictures of the whiteboard and puts their names into an Excel sheet. This Excel sheet is then published on the student course and is available to all Delta students.

The data is formatted with students on the X-axis and projects as columns. In essence, it is a pivoted table. On which the relation of a student to a project and vice-versa is modelled. For this project, a static version of this Excel sheet is downloaded to create a dashboard on.

3. Standardizing data

Because the table is in a pivot format and the only goal is to model the relation between students and projects. The wide-format table is converted to a long-format and empty values, that indicate that a student is not part of a project, are removed. The result of it is a long format table with students and the projects they participate in.
 
This long-format table is then converted to a network graph. This allows me to model the relation of students to projects. More importantly, it allows me to convert the table to a bipartite graph. This gives me the ability to remove projects and directly model the collaboration network of student to student.

4. On network theory

Now that a data model is created, I can look into creating metrics to define at risk students. For this it is crucial to lay a foundation from which we can have a common vocabulary.
 
A network graph consists out of 2 components. The first component is a node, this is an individual description of data that could describe its relation to other nodes. An edge is the relation between 2 nodes. In this case, that is the relations between students and a project.
 
The network is a bipartite graph, this means that there is a different set of relations. In this case, a student is connected to a project and vice versa. But based on the data model, it is not possible for a student to have a direct edge with another student. This means that the data consists out of 2 separate data types, making it a bipartite graph.

The relation of student A to C could be explained as having to go through 1 edge instead of 2. This is because a project is a stand-in for interpersonal collaboration. The project can be removed to model the relation of a student to a student. This results in a model where student A and B are directly connected to one another.

Student Network Optimization
Bipartite graph example

5. Metric development

Now that I have a bipartite graph of student to student collaboration. It is possible to model the relations between students and their importance in the network. To do this, the aggregator metric for rankpage is used. Rankpage calculates the importance of one node in relation to its edges. This allows me to model how individual nodes affect the entire network, especially how important they are. Since rankpage uses a metric of 0-1, direct metrics can be put in place to calculate at-risk student. The first metric developed for Delta students is prominence. Prominence describes how important one node is to the entire network, if it is below a pre-set value, a student is considered at risk.

The second metric created is the average network similarity. To do this, I calculated the Simrank similarity of all students in the network. Then I take the average of simrank of a student to other students. The result is a metric that describes a set of students that are at risk of having a too similar collaboration network. In this case, the metric selected was 0.3025 based on manual testing.

6. App development

The result and metrics of the analysis were finally put into a dashboard where students can look themselves up and get back what their score and a suggestion of collaborations.

The collaborations were obtained using a similar method and calculating the bottom 5 similar students (I.E students not like the target student). These are then given back as a suggestion of students to work with.

The dashboard also shows the metric for network diversity. This is done by calculation of the diversity metric without the boolean operator. This value is then subtracted of one to get a range of solutions. In addition to this, lower values further away from the key metric are exaggerated to motivate students more to contact other students. Nudging students to engage with students they do not work with.

The final result is the Student Network Optimization dashboard that you can see below:

Conclusion

Student Network Optimization focuses on connecting students to one another. This product has focussed on providing an aggregate metric per student to gain insight into their risk and possible people to work with.

Feel like this could help your organization or school? Get in contact with me to talk about the possibility of implementing Student Network Optimization for your organization!

Sources:

Want to modernize your processes?​