Characteristics of an effective data scientist

Recently, people dealing with Data Science are one of the most sought-after specialists in the labour market. The demand for data scientists is already demonstrated not only by IT industries, but also by companies
that have not yet dealt with the analysis of large information resources.  

What competencies should a data scientist have? What should characterise him or her? Are there competencies without which this profession cannot be pursued? The best specialists are described by certain features that distinguish them from the crowd of ordinary analysts.

When we think of employees in this field, we usually look at analytical qualifications. Much less often “soft” skills are mentioned, which are also very important in this industry. Before they start working, data scientists must master
several important skills without which working in this interdisciplinary field would not be possible.

There is my (subjective) list of qualities that every good data scientist should have, and we will start it a bit puckishly
with something that hardly anyone mentions at the beginning, i.e. communication skills.

1. COMMUNICATION

In the work of data scientist it is extremely important to communicate skilfully. The key to success is effective communication during each stage of the project. The specialist has to communicate accurately while defining the problem, analysing it, solving it and also during presentation to other team members. They need to be able to talk to both data scientistsand people who don’t know the industry very well. Explaining data to non-analytical team members is usually the most complicated stage of the communication process during the whole project. It has to be done in an understandable way and using simple language so that team members without technical preparation can assimilate the subject. It is important to be able to explain complicated things in an easy way. Data scientist must remember to adapt the way of communication to the capabilities and needs of their recipient.

2. KNOWLEDGE OF ALGORITHMS AND METHODS

Theoretical and practical knowledge of algorithms is also a solid foundation for understanding and learning new approaches. Scientists involved in data analysis should be able to easily assimilate new methods. Big data is a
topic that is currently in full bloom and will certainly uncover many more of its possibilities. What we know today, tomorrow may turn out to be outdated by new solutions, so an effective data scientist should easily use new,
previously unknown methods and solutions. Studying and using algorithms in practice develops intuition, which contributes to more effective problem solving.

3. PROGRAMMING SKILLS

This may seem obvious, but it is a basis that cannot be forgotten when completing a list of skills of data scientist.
A good knowledge of at least one programming language is a must: R or Python, and, importantly, having the ability
to quickly assimilate
, learn and use new unknown tools. The data science industry is growing fast, so effective data scientist can’t stop learning. (S)he should constantly expand their programming skills. The world of data science tools is
changing extremely fast. A good data scientist should be willing to learn and gain new qualifications.

What is more, it is a feature that distinguishes the data scientist from an ordinary analyst. The data scientists must program their solutions to work automatically. Analysis of huge data resources cannot be done on a piece of paper,
so knowledge of any programming language is essential to be able to work efficiently.

4. SQL KNOWLEDGE

Another feature, associated with the previous skill, is knowledge of SQL. One may wonder if this shouldn’t be treated simply as one of the programming languages. But I want to underline its special role. Despite the flourishing NoSQL databases and various alternative ways of storing large data sets, SQL is still irreplaceable in many solutions. It is the most effective way of acquiring and preparing data for further work. Knowledge of SQL is an essential competence of data scientist due to frequent work with huge databases.

5. CURIOSITY

Data scientist must be curious, ask questions and search answers, dig deeper all the time. They should be open to new experiences and constantly looking for better, more modern solutions. The data science industry changes from day to day. Better and better solutions and more interesting methods are being developed. While working in this profession, you have to constantly develop your skills so that one day you don’t find out that you’ve slept through an innovation without which you’re not as productive as you used to be. What counts in the industry is the effectiveness, efficiency and simplicity of solutions, so unsatisfied curiosity is something that accompanies every productive data scientist.

6. SCEPTICISM

Another feature that should characterise a data specialist is common sense. It leads to a rigorous approach to the analyses performed, their verification, a thorough check whether the “discovery” made does not result from an error
in the data, in the applied methodology or in the interpretation of results. You can’t be sure of anything in life, that’s why a data scientist should be scrupulous about what (s)he does and take all results with a ‘pinch of salt’. Scepticism allows you to be critical even of your own work results, which can eliminate the possibility of error.

7. BUSINESS APPROACH

To understand the problem to be solved by a data scientist, it is good to know a little about the business the query is about. Understanding the wider context and conditions in which business operates is often necessary to achieve the best results. Without this it is difficult to find the right method to solve the problem.

Summary, an effective data scientist must combine analytical, programming, business, and soft skills such as ease of communication. Not without reason, the data scientists are the most sought-after specialists in the labour market.
It is not easy to meet all the requirements, but many of them can be learned through continuous improvement
of one’s work. However, if you meet all the requirements to be a data scientist, you can contribute to innovative problem-solving in many industries, and finding an interesting job with many career opportunities will not be
a problem at all.

How much more valuable is
a customer who trusts you?

We tried to find an answer to this question together with Maria Galas (IKEA Group) and Adam Wysocki (LVLUP Media) during IAB Forum 2020.

The fact that a customer who trusts the brand is worth more, is obvious. The question is – how much more? 5, 20, 100 times more? There has been no precise answer to this question so far, so we decided to give you one. The challenge was huge as we tried to measure something that is, by definition, difficult to measure –i.e. trust.

Our presentation was based on the results of analysis of the behaviour of real customers participating in IKEA Family program. Assuming that the highest form of trust is recommending the brand to friends, we have analysed the differences in behaviours between two groups of customers: brand promoters and critics. The analysis was based
on tens of thousands of receipts, interactions with thousands of mailings and advertisements in paid media.
The customers who trusted the brand turned out to be more willing to repeat purchases, came back more often and spent more. Additionally, the ROI on investment in advertising contacts with promoters was higher. A similar impact
on these customers could be achieved with almost 4 times fewer page views.

The conclusions of the analysis confirmed the thesis that it is worth taking care of customer trust, and also it is worth measuring it. This makes it possible to better anticipate the customer’s behaviour and to adapt the communication
to their needs and emotions.

Trust in a brand translates into sales, i.e. its specific financial results and how much the consumer spends.
And this is important for at least two reasons:

1. The customers who trust us spend more and more often

2. By building communication to a customer who trusts us, we have lower costs – we can achieve the same effect with less expense.

But what’s the most important, the customer who trusts the brand is the greatest added value.

8.31
Zrzut ekranu 2020-11-18 141239
Zrzut ekranu 2020-11-18 141301
previous arrow
next arrow

 

Why is know-how transfer and effective communication between the customer and the data science company important?

Machine learning and data science are revolutionising all industries and becoming an important element of competitive advantage, determining market survival as well. It seems that today nobody is questioning the sense and need of using data by the companies. Possible discussions that arise in the area of data science concern the model to be used to build competences, and not whether to start the challenge at all. Two approaches clash here. One assumes focuses on creating an in-house team. And the other one relies on outsourcing. Both approaches have their advantages and disadvantages. Both have their opponents and supporters.

Outsourcing – for or against?

The purpose of this article is not to discuss the arguments for and against outsourcing. I would like to focus on one aspect of it , which often escapes attention in discussions. I think that the properly formulated question is not “should you” outsource, but “how” to do it and what to pay attention to.

The essence of the problem is best illustrated by certain “real life” situations, we had to deal with more than once.
So let’s imagine having a conversation with one of the clients – a manager responsible for advertising:

— We have an attribution model and it helps us optimise budgets.

— Great. Did your team build it?

— No, We had external consultants.

— But you know how this model is built, do you? How does it work?

— Unfortunately not.

Or a slightly different example:

A conversation between a manager responsible for marketing and an external company providing
predictive model building services:

— …and we’d really appreciate if our analyst could work with you on the model.

— Why?

— We would like to understand how it works and develop our competence in this area…

— Hmm… I don’t think this would be possible.

Solving the problem

Unfortunately, such situations are still very common. The consultants fear that once they educate the client, they will become redundant and will lose their orders, so they feel reluctant to disclose their know-how. It may seem that at least partly these fears are justified.  In my experience, almost every data science initiative that has been implemented opens doors to several more. 

On the other hand, let’s look at it from the point of view of the client, in this case, a company looking for support in using data in its business. Can the company afford to outsource 100 percent of something  that is supposed to be its competitive advantage?

Well, the answer to this question is the same as the answer to the question, whether a car manufacturer can outsource the entire department designing new car models… Probably it can, but rather shouldn’t.

With a change in the perception of the role of data science by managers and a growing awareness of the importance of data-based solutions for the achievement of company goals, there is a growing awareness of the need to build internal know-how. When looking to speed up data science projects or optimise costs, companies are considering using external services. The partners willing to share their knowledge are particularly valuable. Advanced projects will be easier to implement with a company that sees client development as an opportunity to strengthen cooperation. 

So how to choose a partner that will want to share its knowledge? Is it possible to check the level of willingness of the vendor to transfer know-how? By asking the right questions early, you can see your potential business partner’s approach to know-how transfer. Getting answers will help you make a more informed choice.

Key questions

1.How will you document the solution elaborated and the way of reaching it? – The lack of appropriate documentation prevents or significantly hinders further development of the tool.

2. How will our employees be able to get involved in the project?  – The possibility of involvement depends on the nature of the project and the client’s competence. In general, however, the greater the involvement of the client’s employees, the greater the chance for the success of the project, successful implementation of the solution in practice and transfer of knowledge from the area of data science.

3. In what form will the solution be provided? – In addition to the deliverable itself, it is also worth receiving documentation
and preferably also source codes that will allow further development and improvement of the solution.

4. What training on the use of the solution do you envisage in the project? – It is worth knowing at the beginning of the cooperation what training courses are planned to make sure they are sufficient.

5. How many of your projects are one-time initiatives, and to what extent are you focused on long-term cooperation? 
– A question that will verify the company’s approach and willingness to further cooperation.

The clients increasingly expect, and even demand, the involvement of their analysts in the process of building and testing data science solutions. This trend will be intensify. Outsourcing combined with know-how transfer may become a golden rule and a solution to a dilemma – in-house or outsource, combining the best of both approaches.

Data revolution

Today we’re living in the 4th Industrial Revolution. Revolution based on data, their processing and drawing conclusions that are important for business.

In recent years we have seen a huge change in the way organisations collect, process and analyse data. We know that although today these increases are already significant, this is only the beginning of a dynamic development. The number of data sources, the amount of data and the possibilities of their connection is increasing. And this, in turn, creates new great opportunities for every company.

What happens to data is a global phenomenon and is particularly evident at different business levels.

Today, every organisation wants to take it to the next level: from basic analytics to diagnostic analytics,
ending with predictive analytics and the highest level, that is forecasting for the future.

The global phenomenon has evolved into a new way of thinking and acting. You can see it in virtually every industry.

The fastest growing companies today are those that are building their position based on data. And it’s not just about the companies involved in Internet business or e-commerce. We see this in virtually every industry – a data-based strategy
allows you to increase your competitive advantage and seize new business opportunities.

Our team has been part of these changes since 2014.

And we have already been awarded many times in the most prestigious industry competitions in Poland.

There are examples of our projects implemented for our clients.

Hourly traffic forecast in stories

Choosing the optimal distribution network

CRM performance analysis

Segmentation of the customer database

Together with over 100 experts in data science, loyalty programs, creative activities and media actions,
we create Data Science House.

To accelerate the revolution of the entire organisation based on data.