13

I have a homepage where I typically put my published manuscripts on together with information about me. I have a tracker/analytics page that tells me which paper is getting more downloads/views, because I am interested in knowing which of my work/projects people are most interested in (citations don’t always equate that, my most “active” paper has a few citations).

A colleague of mine told me this might be perceived by some as inappropriate. I don’t really check countries/individual information of visitors, just an aggregate on a per paper basis.

Is this generally ok or should I deactivate the analytics link?

user3508551
  • 3,888
  • 4
  • 22
  • 33
  • 11
    The web server logs that info, in any case, by-the-way... – paul garrett Jun 24 '23 at 17:26
  • 2
    I'd say it's inappropriate to use any data without explicit consent. (Even if most people won't care.) – user111388 Jun 24 '23 at 19:06
  • 12
    Asking about a "tracker" is deeply ambiguous whether it's a tracker that logs and exposes individual IP addresses, vs a tracker that only reports in aggregate which document got how many hits, in which date-range, from which country or domain (.com/.edu/.net/.tld...). – smci Jun 25 '23 at 02:56
  • 1
    @paulgarrett: To /dev/null :-) – R.. GitHub STOP HELPING ICE Jun 25 '23 at 03:12
  • 6
    Please specify exactly the raw information you are collecting – user2705196 Jun 25 '23 at 15:25
  • 1
    @user2705196 I am not collecting any data. It’s through a third part tool (google analytics) and it shows me the aggregate information (although it does also have country based statistics, etc). – user3508551 Jun 25 '23 at 19:09
  • 4
    @user3508551 Whatever the data is that google analytics shows you is data that you are collecting. Please specify what exactly that data is. I understand that I could just look that up, but it's best if you specify it here so we can decide how ethical, anonymous, and normal collection of such data on your academic website is. – user2705196 Jun 25 '23 at 21:11
  • 2
    @user2705196 For a specific date period, how many 1) Visits on a certain page are there 2) Which countries visited which pages the most (I never look at this as it is irrelevant) 3) The geographic location (countries/cities) of the visitors 4) Whether there was a visitor in the last 30 minutes (called real time visitors), and if so, geographic location (country, but not city), which pages they interacted with. I also get to see if it was "direct" or "referral" meaning coming from Google search, or as a "referral" from one of my co-author pages that link my page (I don't know referral source). – user3508551 Jun 26 '23 at 01:06
  • If there are issues with using a cookie-based tracker, use of something like webalizer that just looks at the server logs of requests should fill your described needs. Just watch the number of times the paper download link is requested and you're good to go. – IronEagle Jun 26 '23 at 03:04
  • 1
    If said homepage is hosted by your employer (probably the university), check with your employer what is allowed and what is not. – Abigail Jun 26 '23 at 05:37
  • From a software perspective, if you want to avoid any issues with browser-based analytics, since these tend to be quite invasive, you could capture all of the information you want in your back-end quite simply – ScottishTapWater Jun 26 '23 at 09:59
  • 5
    I think the question should include the fact that you are using Google analytics. Because having a download counter that does exactly that under your control, or having a data hungry third party analyse the heck out of the visitor across many sites, is fundamentally different. – BlackJack Jun 26 '23 at 11:03

5 Answers5

18

Just be careful you're complying with consent.

Perhaps the most rigorous is the EU's GDPR (General Data Protection Regulation), which states the following about cookies:

To comply with the regulations governing cookies under the GDPR and the ePrivacy Directive you must:

  • Receive users’ consent before you use any cookies except strictly necessary cookies.
  • Provide accurate and specific information about the data each cookie tracks and its purpose in plain language before consent is received.
  • Document and store consent received from users.
  • Allow users to access your service even if they refuse to allow the use of certain cookies
  • Make it as easy for users to withdraw their consent as it was for them to give their consent in the first place.
henning
  • 35,032
  • 10
  • 121
  • 151
Peter K.
  • 4,938
  • 1
  • 26
  • 37
  • 3
    I am genuinely curious if GDPR applies in this case. There is a reasonably broad carve out for personal websites (https://www.visitor-analytics.io/en/blog/what-does-gdpr-compliance-mean-for-personal-websites/); IANAL but this would seem to apply to academic websites. Of course, "analytics" are typically collected by a third party provider such as Google, in which case IDK. – kyle Jun 26 '23 at 18:55
16

A colleague of mine told me this might be perceived by some as inappropriate.

By whom exactly? "Some?" As opposed to what, every single website on the Internet? It does not seem that you are proposing to place a cookie that will track the user's activity across the Internet. You just want to see how many people are visiting your page. Totally appropriate.

Cheery
  • 13,573
  • 2
  • 31
  • 63
  • 2
    Yes, certainly there is a huge difference between logging/counting the IP addresses that visit your site, and planting cookies on somebody's browser... – paul garrett Jun 24 '23 at 21:52
  • 13
    Using a readymade analytics solution will, however, track the user across the Internet, it just reports only per-site statistics to site owners. – Simon Richter Jun 25 '23 at 05:51
  • 3
    Any web server will generate a log in a standard format, and there's 30 years' worth of scripts for analysing those. There is absolutely no need to start messing around with cookies, much less "social media" trackers. – Mark Morgan Lloyd Jun 26 '23 at 06:26
  • @Mark Morgan Lloyd: Yes, assuming the web server logs are accessible. That is not the case for many hosting solutions, e.g. shared hosting. – Peter Mortensen Jun 26 '23 at 08:57
  • @PeterMortensen in that case find a competent ISP. If research etc. is published openly, which is after all the tradition in academia, then it's debatable whether the author has the right to find out who's read it (unless they publish derivative work) let alone Google et al. – Mark Morgan Lloyd Jun 26 '23 at 09:21
  • You're not proposing planting a cookie; you're just proposing the execution of third-party Javascript that will likely fingerprint their browser and send information to some mother ship somewhere. – Kaz Jun 26 '23 at 15:12
9

Is this generally ok or should I deactivate the analytics link?

It depends in part on what you mean by a "tracker":

  • If you're really just counting page views (that is, tracking views rather than viewers) then I don't think anyone would object.

  • If you're logging and analyzing information that users provide anyway (the IP addresses from which requests originate, for example) then I don't think most people, academic or otherwise, would object, but you may be on shaky ground with respect to some privacy laws, such as the GDPR. In this case, you should consider deactivating the analytics for your own protection.

  • If you're installing tracking cookies or using similar technology to identify visitors and track their activity then that's not ok with me, and probably not ok with the EU (GDPR), and maybe not ok with various other countries and their privacy laws. Probably my browser foils your attempt, but I would consider it rude that you tried. In this case, you should deactivate the tracker for your own protection, for your reputation, and because it's the right thing to do.

John Bollinger
  • 211
  • 1
  • 4
1

It's your homepage, you can put whatever you want on it.

There does, in my experience at least, tend to be a higher concentration of 1) people who understand the privacy issues of trackers, and 2) people who care about trackers, in acadaemia but there's no general consensus that this is unacceptable.

Anyone who has a problem with internet trackers probably has browser extensions to block them anyway, and they're always free to not use your website anyway.

Just make sure you have the appropriate consent notices.


As a side note, you could capture all of the information you mentioned in your back-end quite comfortably and avoid needing any sort of browser based tracking entirely.

ScottishTapWater
  • 2,247
  • 9
  • 23
  • 1
    I'd appreciate some context as to the downvote so I can improve this answer... – ScottishTapWater Jun 26 '23 at 12:23
  • 3
    I'm guessing people don't agree with the "It's your homepage, you can put whatever you want on it" statement. There are many things that are illegal that you can't put on your homepage. In particular, for European visitors, it is illegal for you to collect certain type of information about them without consent. – Cris Luengo Jun 26 '23 at 15:22
  • 1
    @CrisLuengo - I did say that you need the appropriate consent notices... – ScottishTapWater Jun 26 '23 at 15:35
  • 3
    Many things are illegal with or without consent. – Cris Luengo Jun 26 '23 at 15:37
  • 1
    I would have assumed that that goes without saying... It's obviously unacceptable to host illegal content – ScottishTapWater Jun 26 '23 at 15:38
  • The question isn't about whether this is legal, it's about whether it's appropriate. – kaya3 Jun 26 '23 at 17:54
  • @kaya3-supportthestrike - Not sure if that was aimed at me or Cris... But my answer addresses whether it's appropriate or not with "there's no general consensus that this is unacceptable" – ScottishTapWater Jun 28 '23 at 11:25
-8

EDIT 2: I have just read that you are using Google Analytics. I did not realise that was the case when originally reading your question. I think that you should mention this in your question, because it is very different to what I had imagined when originally reading it. In this case, I would most definitely recommend that you have a disclaimer that you are using Google Analytics on your web page. Google Analytics, as others have said, sells data to third parties, that's why the service is free. There's no such thing as a free lunch, as they say. I'd recommend you use something other than Google Analytics to track your downloads. How are you hosting your site? Is it through a paid server, or are you hosting the server yourself? There are plenty of ways that you can analyse how many downloads of your papers are occurring without resorting to Google Analytics.

EDIT: (After received constructive criticism)

I personally do not view it as inappropriate, so long that you are collecting anonymised data, and not logging the IP addresses of individuals who download your papers. If this is the case, you can have a simple message saying "I compile anonymised data relating to the download and view counts of individual papers, to help me determine which of my papers garner greater interest, which guides me to understand which of my topics people are most interested in. I do not log the IP addresses associated with these downloads nor do I log any other personal information." If this is the case, I believe that it is not an invasion of privacy, because you aren't logging who has read your papers.

I find it strange that an individual would state that people would find this inappropriate when it is a standard feature of virtually all cloud hosting services, and (assuming it is anonymous) does not seem to me to constitute an invasion of privacy, because you would not know who any of those people were. Regarding the country of origin, that could be beneficial to know which countries are most interested in your research, which would be relevant for obtaining grants etc. I don't think it would be an invasion of privacy, because countries comprise large numbers of people, and you would not know which people specifically were reading your papers (again, given that you do not track the IP addresses).

Javed Alam
  • 23
  • 1
  • 8
    Downvoted because this answer seems to be an opinion regarding the acceptability of internet tracking in general, rather than an answer to the question. The last paragraph regarding your personal life obviously has meaning to you but is entirely irrelevant to answering the question. – traktor Jun 25 '23 at 12:04
  • This answer treats the question as if it is a simple yes or no answer ("either it is inappropriate or it is not"), but offers an opinion, rather than a real answer. I agree that the last paragraph has nothing to do with the question. – BioBrains Jun 25 '23 at 15:47
  • 7
    Despite the moderation strike, I removed the wildly inappropriate final paragraph (for anyone reading traktor and Bio's comments now). Accusing an anonymous person you've never met of gaslighting is certainly something. – Azor Ahai -him- Jun 25 '23 at 18:03
  • I apologise for answering in a way that obviously was not in line with the regulations of the forum. It was my first post ever here. What does the moderation strike mean? Yes, I posted my opinion, I didn't realise that wasn't what people do on forums? Are people supposed to have citations that refer to other people's peer reviewed opinions?

    I am against tracking on the internet, but having anonymised statistics about how many people download your paper is not tracking, is it?

    – Javed Alam Jun 27 '23 at 05:47
  • The last sentence, in retrospect, was inappropriate, however it was relevant to the question. The OP stated that their colleague stated that "some" people may find it inappropriate. I was trying to illustrate with an analogy that the term "some" is vague, and is often used to refer to a minority (e.g. "let's go to the supermarket and buy some bread) - are you going to buy every loaf of bread in the store? Highly unlikely. – Javed Alam Jun 27 '23 at 06:01
  • I am confused about what the moderation strike is, and I am also confused why writing such a down-voted post gained me 11 reputation. It can't be because this is the first time I have answered a question, because I have answered questions on other areas of StackExchange, and have not received reputation for it. How does this system work? Does it mean that the up-votes count more than the down-votes? – Javed Alam Jun 28 '23 at 10:37
  • So I just read how the reputation system works. If you get an upvote, you get +10 reputation. Downvotes give -2 reputation. I received +12 reputation (the +11 was an error). With a score of -9, this means that: 10x - 9*2 = 12, 10x - 18 = 12, x = 30/10 = 3. So 3 Upvotes and 9 downvotes gives a positive reputation. I find that interesting. Thank you to those who upvoted me, I am learning and I appreciate the kindness. EDIT: I just realised that if it were 3 upvotes and 9 downvotes then the overall score would be -6, not -9. So my maths is incorrect. Thank you for constructive criticism and votes – Javed Alam Jun 28 '23 at 10:52
  • Also, is this comment invisible now? Because no one seems to reply to my questions or edits to the content of the response @AzorAhai-him- – Javed Alam Jun 28 '23 at 11:00
  • @JavedAlam We do not receive notifications unless you @ ping someone. – Azor Ahai -him- Jun 28 '23 at 12:59
  • @AzorAhai-him- could you please explain what you mean by the "moderation strike"? Thank you for your assistance. – Javed Alam Jul 12 '23 at 02:13
  • @JavedAlam Many diamond moderators and high-rep users are refusing to perform moderation actions (such as close voting, or editing content) in protest of SE's generative AI policies. I am participating, but felt that your answer was egregious enough to edit in spite of the strike. You can find links in the meta sidebar. – Azor Ahai -him- Jul 12 '23 at 22:07
  • @AzorAhai-him- I apologise, as a new user, I did not realise there was a strike. Apologies, and stand strong. – Javed Alam Jul 13 '23 at 04:18