Hi everyone,
I’m a PhD student in Computer Science researching why people choose to self-host software—what motivates you, what concerns you, and what factors affect your decision-making.
To better understand this, I’ve prepared a short anonymous survey (~10 minutes). Your insights as part of the self-hosting community would be incredibly valuable for this research.
🔗 Survey link: https://survey.lpt.feri.um.si/376953?newtest=Y&lang=en&s=ls
This study is part of my doctoral research at the University of Maribor, Slovenia, conducted under the supervision of Assist. Prof. Lili Nemec Zlatolas, PhD. All responses are anonymous and used strictly for academic purposes.
If you’ve ever self-hosted anything—or even just considered it—I’d really appreciate your input.
Thanks a lot for your time, and feel free to ask me anything about the project (luka.hrgarek@um.si)!
Cheers!
I’m old enough to consider the framing of the question to be weirdly loaded.
It does not feel that long ago where people would be asked to justify entrusting their product’s functions and data to a bunch of strangers who can make unilateral decisions about your service with zero comeback. Now we’re being asked to justify not doing that.
Thank you for your comment. The use of similar statements is a common practice in survey research, as it helps to capture various dimensions of a construct more reliably and provides a clearer understanding of individual perspectives.
Regarding your concern, the purpose of this study is not to ask anyone to justify or defend their choices, whether it’s about using third-party services or self-hosting. Instead, we aim to identify the factors that influence such decisions, from a scientific standpoint, to better understand the motivations behind them. The goal is not to judge whether one choice is better than another, but to gain insights into the different considerations that shape people’s decisions when it comes to managing their data and services. Thank you again for taking the time to complete the survey.
Sure, I’m just bemoaning the fact that you’ve taken cloud hosting to be the default. It’s as much a complaint about the world in general as anything specific to you. Good luck with it all.
Totally understand your concern, and you’re right, the assumption of cloud as a default can be frustrating in many ways.
That said, this framing partly reflects the state of the academic literature: in the past 10–15 years, cloud adoption (especially SaaS) has been extensively studied, to the point where it often feels “default” in research too. In contrast, self-hosting has been far less explored, which is exactly why we’re doing this study—to help fill that gap and highlight its relevance, especially in academic contexts.
Thanks again for your thoughts and for the good wishes! :)
Yeah, those data questions are really loaded. I don’t host for privacy or what not. It’s because of a learning objective, to study, experiment, and run automated stock trading algorithms. I don’t exactly have anything to hide from private companies.
Thank you for your comment. The use of similar statements is a common practice in this type of research, as it helps to better capture different aspects of a construct and ensures reliability. I understand that privacy may not be your personal motivation for self-hosting, and that’s perfectly fine. The purpose of this survey is to explore a variety of factors that can influence why individuals choose to self-host, and to determine the relative importance of each. Even if certain factors don’t apply to you, your responses contribute to a broader understanding of the motivations behind self-hosting. Thank you again for taking the time to complete the survey.
If I have a file, I have it.
If google has my file, they say they have it. I’m told it’s there. For how long? I dunno. Private? Hell no. Forever? Likely not.
This small discrepancy is the entire drive behind me selfhosting.
I’m a minimalist with selfhosting, a raspberrypi with a vpn connection, syncthing and a samba share is all most anyone really need-needs.
I downloaded music I bought online and copied it to my Google drive once. This was years back, mid 2010s, this album just came out for my favorite artist back then. I’d downloaded it back to another pc and a week later - poof. No more mp3s. @.@
Edit: just that folder of that album’s mp3s, not my whole music library back then, just to be clear. Still, that was my first big burn from cloud services.
Also there’s that a file on a cloud service might change. E.g. Amazon sometimes updates ebook covers to advertise that there’s a show - even for those who have paid extra to have the ad-free option.
E.g. the sticker-type graphic on this and that the title is updated to “The Fires Of Heaven: Book 5 of the Wheel of Time (Now a major TV series)”:
Have you thought about contacting Louis Rossmann? He created an extensive video guide on how to self host using FOSS. Perhaps he’d be willing to highlight your survey to his over 2 million subscribers.
That’s a good idea, and maybe even Henry from Techlore.
Done. Nobody else wants to know why I have 3 RasPi’s running stuff around the house, so I get to tell you in the survey, lol.
I want to know 👉👈
I had a hard time answering these because my opinion on cloud service depends on the cloud. (Google vs nextcloud for instance)
Done.
I feel that it may be helpful to try to capture the motivation for each type of service.
For example. The reasons I host a media server are different than the reasons I host a photo backup solution.
Thank you for doing this research. I dream of the day that self hosting becomes as easy as spinning up a consumer router.
I self host for the same reason I’m not clicking some random link: distrust lol
um.si is for University of Maribor in Maribor, Slovenia. It looks legit.
As a fellow comp sci graduate (different uni, long time ago) that doesn’t fill me with a ton of confidence lol.
This could actually be a study on phishing lol
Or a less ethical study on virus propagation
Or maybe he has just gone rogue and his university hasn’t noticed because they probably don’t actually monitor what students are hosting very closely, as long as it’s not causing problematic network traffic.\I doubt it, but I’m still not gonna click a link that someone is asking me to click lol
If you’re that vulnerable to shady URLs, you may want to rework your blockers or even spin up a VM. If you’re that venerable you phishing, just don’t give them your numbers.
Why risk it?
Can’t limit yourself to twitter, facebook and google. Hell, I think those sites are the riskier ones.
Those are known evils I know how to deal with, that are popular enough that there are foss tools to handle them.
But I’m being a bit hyperbolic here. I’m not as paranoid as I’m portraying myself… I just don’t have any motivation to click this particular risky link.
I was kind of surprised by my answers when I stopped to reflect. I realized I:
- don’t really like self-hosting
- know a lot about new tech, but am not very excited about it
- don’t use a lot of the popular services
Anyway, I hope the results are useful! I don’t know if you’ve done it already, but it would be interesting to compare results from different sources, like Lemmy vs Reddit or wherever you posted it.
Because I dont need to pay rent for my files and I don’t have to worry about AI and VCs trying invade my privacy.
Done. Are you going to be sharing the results here? That would be cool.
Thank you very much – I really appreciate your participation! Yes, the results will be published as part of my PhD dissertation, and also in one of the peer-reviewed journals in the field of Computer Science. Once everything is finalized and publicly available, I’ll definitely share a summary and a link to the publication here as well. Thanks again for your interest and support!
Awesome. I’m very curious about your findings. Looking forward to it.
I submitted a response but if i may give some feedback, the second portion brings up:
I am willing to pay a substantial amount for hardware required for self-hosting.
This seemed out of place because there were no other value related questions (iirc). Such as:
- I believe self hosting saves me money in the short term
- i believe self hosting saves me money in the long run
I’m sure you could also think of more. But i think it’s pretty important because between cloud service providers and any non-free apps you want to use, it can be quite costly compared to the cost of some hardware and time it takes to set things up.
The rest of my responses don’t change but if you’re wanting to understand the impact of money in all of this, i think some more questions are needed
Best of luck!
My only quibble would be to swap “pay” for “invest” which captures both the dynamic of up front expense and expected savings from ending recurring subscription fees. That’s how I look at it. Every penny I put into my own digital sovereignty is an investment that will yield returns both financial and otherwise.
Not to mention that a lot of self-hosting can be done on hardware you already had laying around.
I saved money by stea- I mean borrowing - work equipment
Second this - so far it has cost me money, but as I am able to cancel more subscription services, the savings will add up.
And I self-host precisely because of the money I save using surplussed hardware. I have a symmetrical 1Gb SOHO fibre connection from my ISP, so I can host whatever the hell I want, I just need to stand it up. And a beefy older system with oodles of RAM is perfect for spinning up VMs of various platforms for various tasks. This saves me craploads of money over even a single VM on cloud platforms like Vultr. Plus, even if I were to support a “heavy” service sufficiently in demand to warrant its own iron, it still costs me less than a year’s worth of hosting to obtain a decent platform for that service to run on all by it’s lonesome.
My only cloud costs end up being those services which are distributed for redundancy and geographical distance, such as DNS and caching CDNs.
Submitted.
I am in finance/accounting/consulting and I run one self hosted small server at home to manage my home machines remotely and use it with combo of proton to do most things.
While I am in the process of de-googleing I haven’t self hosted photo or email yet as proton has been working okay for the time being. One step at a time I guess for me… I am not in IT so it requires time for me to read up on things and set them up (rtfm indeed)…(yes we all know what proton CEO said… )…let’s see how it pans out.and this has been quite a journey for me. I am still having issues with family and friends not wanting to use jitsi (that I am using for calls or contact me on signal). I have basically thrown WhatsApp and anything I don’t care about within beeper under a work profile (android) I.e. in a sandbox / appimage (laptop).
Personally if next cloud allowed self hosting with self email hosting (whatever the technical terminology is) that would have been (chef’s kiss) muah…I would have jumped on it first hand…
Sorry I am not in IT so my jargon usage might be sub par.
Honestly, email is the one thing I wouldn’t self host, it’s just so much hastle getting it set up right and can be incredibly difficult to have emails you send not end up in spam. Just not worth the hastle imo.
Thanks, much appreciated!
Done but I felt lots of questions to be very similar. Maybe there is a form platform that can show only a subset of control questions for every survey.
Thank you for completing the survey and for your thoughtful feedback. The similarity between some questions is intentional and follows common scientific practice when measuring complex or abstract concepts. Using multiple, slightly varied items that target the same construct increases the reliability and validity of the data by capturing subtle nuances and reducing the influence of random response variation. While your suggestion to show only a subset of such items through adaptive platforms is valid and worth exploring, fixed item sets are generally preferred in research settings to ensure consistent and robust measurement. We appreciate your input and will consider it in future survey design improvements.
To be honest, if 3-4 questions in a row had same-ish wording, I just replied the same thing 3-4 times.
As designed. It proves you actually read the question.
- It’s educational for those who have a lust for learning.
- It’s fun.
- It’s far more private than using commercial cloud services.