Reputation System

Company: Shopify
Role: Senior Product Designer
Type: System Design

Expert profiles helped differentiate partners, but comparison was slow and lacked trust signals; leaving merchants uncertain, limiting expert credibility, and weakening platform governance. I designed a multi-dimensional reputation system that captures merchant feedback at the right moments in the job lifecycle, accelerating first contact and job creation, increasing merchant trust, enabling expert ranking, and clearly signaling partner expertise through service-specific ratings.

Problem

While profiles were a good start to differentiate Experts (Shopify freelancers), comparing profiles was time-consuming and lacked trust signals for the merchant to make a confident decision. In turn, experts couldn’t create clout and Shopify lacked the ability to govern the platform efficiently.

Solution

A multi-dimensional reputation system that captures merchant experiences at the right moment of their journey, and respects the core loops of successful and failed jobs.

Impact

Acceleration of first contact and job creation
Increased trust and expectations by merchants
Creation of a ranking system for better expert surfacing
Clear signals of the expertise of partners via service specific ratings

Multidimensional reviews

Jobs can be complex, so we provided a low cognitive glance of its aspects. Experience through written text, cost expectations, what other services the job might have entailed, and a judgement of the dimensions of a service: communication of the expert throughout the job and the quality of their final work.

The final solution featured a two-dimensional score visible via tooltips

Ranking system

The reputation system was key to the creation of ranking in the platform. We used the different triggers that are hit through the system’s flow to obtain explicit data like job completion and user ratings, as well as implicit data like fizzled jobs and churned communications.

The star rating was not the only data point affecting ranking

Service expertise

The reputation was service-aware, allowing merchants to clearly observe where an expert had most of their experience (jobs completed) and where their expertise is (ratings).

Reputation was service-specific, reflecting each expert's skills and services

What is a job?

The main challenges of the reputation system was to determine what a an experience was, in other words, what a job was. For that, we needed to answer these questions:

When does a job start? When does a job end?
What is the green path? What are other paths a job can take?
When is the right moment to ask about the merchant’s experience?

The back-end of communication system was complex, so we needed to illustrate clearly the merchant’s flow through a job to confidently answer those questions and decide where each reputation trigger would reside.

Iteration

Our first iteration was purely theoretical which even passed validation testing. Only after release we were able to observe real behaviours, which shone a light on issues in the system. From there we were able to pivot quickly and make empirical changes.

Expectation vs reality of the flow of a job

Quickly pivoting

Unforseen issues lead to system gaming and feelings of unfairness. We had to let go of the expected job journey, but instead ask ourselves “What makes a merchant entitled to a review?”, thus changing what we consider a job started and a job ended.

what the job experience looks like in detail in our system

Explicit triggers

The reputation system relied on explicit actions and input by the merchant. So we needed these inputs to:

Appear at the right time - when a full experience has concluded and the merchant is not interrupted.
Appear in the right place - Noticeable and understandable within the context
Strike a balance - require enough effort to obtain valueable information, but not too much that becomes a chore for the user.

Striking a balance for reviews

Jobs are complex and unique to each merchant. We wanted to obtain as much relevant information as possible to bring value back to the top of the funnel: other merchants looking for an expert to work with.

Here you can see that our initial explorations required a lot of input. The intention was to bring objectivity to reviews by capturing information first from experts then the merchants. Instead, it created a bottleneck on the expert’s side.

As we iterated, we were able to pair down the amount of information we were asking for. We had removed the bottlenecks and reduced the overwhelming feeling of a large form by displaying everything without pagination.

We started exploring different interactions, like dismissing the review and attuning subjectivity by procedurally revealing content. But from experience with the App Store reviews, we knew that merchants are easily overwhelmed. Thus, we needed to continue pairing down the content to reduce the visual noise and perceived complexity.

The final design succeeded in asking for the right amount of information without overwhelming the user. It prioritized required inputs to be able to obtain value when done at its minimum. It proved to be timely by appearing at the end of the job experience, and in the area where the focus of the merchant was (main communication area of a job’s page)