Play the webinar
Play the webinar
Register for the webinar
July 6, 2022
Andrew Mckenna-Foster
Reporting on a repository’s contents and reuse is an essential component for assessing impact and value of the repository on the institution as a whole. This webinar will outline multiple ways to gather statistics that can be shared with researchers and administrators.
Please note that the transcript was generated with software and may not be entirely correct.
0:32
Hello, everyone. I'm just going to wait a few more minutes, see.
0:39
Give time for other folks to come into the room, And then we'll get started.
1:29
OK, Uh, I think I'm going to get going here. Thank you everyone for joining today.
1:37
My name is Andrew McKenna-Foster and a Product Specialists at Figshare.
1:41
I'll be conducting the webinar today.
1:45
I think I'm going to turn off my video should still be able to hear me.
1:53
If you have a few housekeeping things, this is the 30 minute webinar.
1:58
So go by very quickly. If you have questions, please put them in the question box.
2:07
Goto Webinar.
2:08
I, if I see them, I'll just answer them immediately, but otherwise have preserved just a little bit of time at the end to answer questions. And today I'm going to just be giving an overview of all the ways that you can collect information from your picture repository.
2:27
Two, to report on and and understand what's going on with the repository.
2:33
A few things, first, I want to mention there are, we actually have quite a few other webinars coming up. I've just cherry picked a few of them here.
2:42
So you can see the role of funders in publishing and sharing research data publications and conference materials coming up in August.
2:49
How to Figshare integrates and augments research information management systems in September.
2:54
We have the State of Open Data 2022 Presentation.
3:00
Coming up with Plans for October, I want to mention that the, The State of Open Data 2022 survey is still open, ended up until July 18th.
3:08
This QR code will take you to the survey page.
3:12
So, please share widely with researchers or anyone who might be, you know, trying to share data openly.
3:21
Um, and we will close the survey July 18th and then hopefully be presenting on the results in October.
3:30
And then another webinar coming up in November is The Big Share API for End Users.
3:39
The audience for this webinar is, anyone who's managing a secure repository. So, if you are a researcher on picture dot com, the free account webinar's probably not going to be the most useful for you.
3:53
But if you are someone who's running a repository based on fixture, or someone who needs information from that repository, this should be the good point of information for you.
4:09
Um, So, the specific things I'm going to cover today, I'm going to mention, of course, the administrative staff dashboard that's available to administrators and the reporting role. User role.
4:22
I'm gonna mention the optional public dashboard.
4:25
That's you can optionally make public, but it's always available to administrators.
4:29
I will also mention the user report that's available to administrators, briefly cover the API statistics endpoints. And then I'm going to end, talking about ways you can gather information using the new Batch Management Tool that we recently launched. I think it was just in, in April, so ways you can report on all that metadata. And I do want to mention that picture incorporates information, pulls information from all metrics and dimensions.
5:00
two of our sister companies within digital science.
5:05
And so I will mention using some of that information and of course if you have a subscription or metric or dimensions, that just gives you even more reporting power.
5:18
So without further ado I'm going to talk about statistics using team FIG share repository so this is a part of the, you know the fixture universita repository that we at fixture use for slides, presentations, online resources related to FIG share and the state of open data.
5:40
As you can see, I'm logged in as an administrator. And because of that I have a statistics link here so this link is available to me as an administrator.
5:51
And also if I had the reporting role as a user in the repository and I was a reporter, I would be able to access this stats dashboard.
6:02
So, this is based on Cubana platform, ..., and a You can look it up if you want even more details on the platform itself.
6:14
And this is a quick kind of orientation, we can select, what stats want to see it in terms of the time duration, and in the past, relative and absolute options here.
6:29
Then, the rest of the dashboard is made up of these frames. that can be charts, maps, tables, just numbers.
6:38
And a big takeaway here is that we can customize this for you.
6:43
So, the dashboard that we have here in Team Picture is basically just our basic setup.
6:49
But if you want, you know, this chart up here, This is the count of views by the ... Share item types.
6:56
If you want this to be something different, that can be done most likely. And. just send a support ticket in support at picture dot com.
7:05
And um.
7:06
And ask and they will tell you what's possible, but you know, perhaps this could be the count of items instead of accounting views.
7:14
Can even change some of the charts so in the right, we see the number items in categories over time.
7:23
And it's the views for each of those categories, so. there are lots of opportunities to gather lots of different types of information. Just a few other things.
7:32
Everything in this dashboard is downloadable as a CSV and so there's these little buttons that appear at the bottom left.
7:42
And we can see all the information in a table format, that can download that, in the, in the raw format or the formatted format.
7:52
Um, And if, if there's a key a legend, that can also appear there to, these stats are updated hourly.
8:03
So, on this map, does zoom in here for to show up.
8:08
So, information on the map is, is updated hourly as well.
8:11
On this map, is views and downloads. Of course, you can have this just views, or you can have it just downloads, and to make this a little more interesting, I'm gonna actually do last five years.
8:24
We don't use this repository heavily.
8:26
So not all the statistics will be that interesting, if I didn't, at the five years or so. We have a table of the count of views coming from different countries.
8:38
And you can see some of the other charts available in the pie chart, by, in this case, uploads by item type in the repository.
8:46
Another bar chart uploads by group.
8:49
At least the top 10, um, and here are examples of just straight numbers. So we have the number of deposits, number of depositors.
8:59
Number of bytes used, total views and total downloads.
9:03
And then, at the bottom, we have our tables with statistical events and uploads.
9:09
And once again, all this is downloadable as a CSV file.
9:16
There are some other options that we don't have setup in team picture. Right now. So, word clouds are possibilities, area, charts, heat maps are also a possibility.
9:28
So, if there is something you'd like to see in your dashboard, then just send a ticket into support, and they can tell you if it's possible and get that setup for you.
9:40
And same as I mentioned before, if you just want to change something, you know, if you want to change this from number of depositors to something else, sorry John and changes from that, the upload item type to something else, very, um, easy to do.
9:54
So, this is the administrative dashboard. You have to be an administrator to see it. There's a another dashboard that you may have publicly available on your repository, because the Royal College of Surgeons in Ireland, they have the statistics visible and a link to more statistics.
10:10
And if we click on that link brings us to a public page that has some of the same information that we just saw, but also different information.
10:19
So, we can narrow down these statistics by group or by item type.
10:27
And again, we can change the timeframe, and we can see views and downloads over time, another version of the map.
10:34
Top items, top groups, categories, and top referrals.
10:41
So you could also, you know, you can have top referrals in your administrative dashboard, if you want it as a word cloud, for example.
10:48
Now some repositories don't have that publicly available, so Iowa State University doesn't have that publicly available but it's you can still get to it if you're an administrator. If you log into your repository and at the end, it won't work for me. Because I'm not an administrator here. Just slash stats.
11:07
It'll take you to a page and I can show that here. So I'm just going to take the ...
11:13
FIG share, URL and.
11:19
stats.
11:22
And now we can see that the team fix share extra dashboards, visible to us, but not publicly.
11:33
So those are the user interface reporting options.
11:39
But there's a lot of other information sources that might be useful for reporting.
11:45
And, one of them, is available to Administrators.
11:52
Under the administration page, and it's the user report.
11:56
Like, go to the user tab, and I can select all or some of my users here, and get download a user report. So, this is only really useful.
12:07
Of course, if you have users in your repository, if you're setting up accounts for researchers, are there they're setting it up through Single sign on.
12:14
Or, perhaps you have a user account that's managing all the items owns all the items in, in various groups, you know, electronic theses and dissertations, Open access papers, or something like that.
12:27
You could use this as a way to get our reporting for those groups, as well. So I've actually already downloaded this and pre opened it.
12:36
This is what it looks like.
12:38
You'd have a list where in the Accounts tab, you might have a much longer list than we do.
12:44
one quick note when you download this report, the columns are usually frozen. So you'll probably need to go to view and in whatever program you're using and unfreeze the columns.
12:56
So that you'll be able to see all the, all the columns.
12:59
Um, and rather than scrolling across here, I have, we have some field definitions here as well. And I actually added this into this report, but this is available online, this set of definitions here.
13:14
Um, so just super briefly, there's information about the account itself and all the items, both public and private that they own, whether they're embargoed.
13:28
It will tell you the number of co-authors on items.
13:32
So, on, in this example, let's take, I can scroll to that.
13:38
Um.
13:41
Say, here we go, so 125 co authors.
13:45
So it doesn't actually list out the co-authors, but it'll tell you, I know you could start with a sheet by who has the most co-authors, how much space they're using storage wise and then information about projects.
13:56
So, if you, so projects are collaborative, collaborative, collaborative space, both internal and external, and, so you can get some information about How many projects a person's part on, whether the reviewers are a collaborator.
14:12
And you can also see what universities are linked to projects within your repository, so if you are using projects, if you are If you do have researchers using projects, it can be very useful way to see there's no external collaborations.
14:32
Few last things, reporting on collections, a way to package items together under one DOI, then we get to statistical events, so views and downloads by items, and then views for collections and projects.
14:48
Then finally, the Alt Metrics score for that account is included in the report, but at the end, maybe, maybe some folks miss it, but it is there.
14:58
And old metric, I'm just going to scroll across to see that on this report.
15:03
Metric.
15:04
Always says that the number itself is, you know, this high level, kind of just overall indication of attention to attention from the internet to a digital object. And you really need to look at the actual information and data, promote metric to figure out what the impact might be. But having that in this report means that you can sort accounts by the ones who are maybe attracting the most attention, and then you can dig deeper. So, that's nice to have in there, as well.
15:35
Um, so, the statistics in general, come from the, uh, the fix share API.
15:46
So that's this next section. I'm not going to go into a lot of detail here. But the API that docs, that fixture dot com.
15:55
And I'm are, I've already navigated to the stats section on the website here on the left, and so it's switched.
16:05
So lots of information about these endpoints, But specifically there's an endpoint to see the geolocation, so you can get a list of where views are coming from.
16:18
There is a timeline so you can see views over time.
16:21
For example, The tops end point which allows you to see top items by views or shares or downloads can associate top authors, collections, groups, and projects.
16:36
Totals.
16:38
You can see the total views, downloads, or shares by these different entities.
16:44
Then there's also an endpoint to count articles. So, perhaps this is useful if you wanted to create some custom reporting, This is one way that's that, you can do that. And I actually will put on our, we have a help page, how to use the fixture API.
17:00
And I'll work in progress. But I do have a very simple script that will add to this page.
17:06
There'll be a link to it, um, kinda what it will look like.
17:10
But it's a way that, you know, if you're not, if you're not an experienced programmer, this is a way just to get started. It's a way to call some of those endpoints.
17:21
If you're calling, if you're querying fixture dot com, you don't need a token.
17:25
But if you're working with your own repository, you'll need to create a token for an account, and then you can start calling those endpoints with a script so that'll be available on this page.
17:35
Just to show you what's possible with the Picture API, the University of Sheffield as it has done a lot of work creating custom interfaces for their users, for their repository.
17:44
This is an example of a user profile page. And they put all the statistics in here.
17:53
So, information about views and downloads, sites and shares, added some interesting charts here.
18:01
Downloads and views, and then the size represents the number of items by type, so that's, that's really interesting to see.
18:09
Next party chart, a table of information.
18:12
Um.
18:13
And then all the items themselves, so this is what's possible if you have developer resources. The API endpoints allow you to do this kind of thing.
18:27
So um, we're actually getting close to the end here.
18:31
I wanted to end today talking about our new batch management tool and how that be used As a source of information or reporting.
18:44
So I'm going to go back to team picture here.
18:49
And if you have an administrator login, you will have this batch management link.
18:56
We're going to do a webinar on this at some point probably this Autumn, to talk about all the how, how everything work here. I'm going to focus on just the download section here.
19:07
But you can use this to upload metadata and files in batch.
19:13
But you can also use this just to download all the metadata from your repository.
19:18
Whether you can just look at private metadata or public metadata, do all the metadata, or metadata from a specific group.
19:27
So, in this case, we can download all the public metadata. Maybe this is for individual reporting, or something.
19:36
Um, and this will e-mail you that file, T two, whatever e-mails associated with your account, I've already done this and have an example up here.
19:49
This is what the download looks like.
19:52
Um, it includes a subset of the metadata that is available with, with every item or collection, So, this isn't everything, but it includes all the fields that you'd you'd actively like, edit, or add to.
20:11
And there's somebody things could be very useful for reporting. So, some things are already in the stats dashboard, you know, like, information about item types, that's probably already there, but you have it here, funding information is included here. So, funding can be added as free text, or linked to dimensions records.
20:30
In this case, this was a free text entry. It looks like it's not linked to a dimensions ID.
20:38
But, some of these are, so, in this case, this has a dimensions ID, a grant ID, from dimensions. So, you can see how these are formatted in JSON. Now, you could break all this out, using a, you know, texts to column each features in Excel or Google Sheets, and Find and Replace, and all that.
20:57
to get, say, all the grant title than one column to report on, I'm going to show you how to do that, using a script that as well. Some other information in this file that might be useful.
21:08
Whether, you can report on whether things have references, and, you know, how connected things are, report on the licenses that are used in the repository.
21:19
Lots of information related to the embargo state of items, and then I want to mention, one last thing, that the resource title and DOI is here as well.
21:28
So, this is that link to a published, paper related to the item in that repository, could be useful to report on how many items in your repository are linked to published papers.
21:44
So, as I mentioned, you have this spreadsheet, if you're comfortable working in spreadsheets, then you can use pivot tables or whatever Need to summarize this.
21:54
I did make a script that I will once again, make available on how to use the Picture API homepage.
22:02
That takes this information and puts it into a slightly easier format and also gathers a little extra information that might be useful.
22:10
I really see this just as a starting point.
22:12
If you are interested in, you know, customizing your reporting, you might be able to use this as a starting point. I won't.
22:22
I'm not going to like go into detail on this, but basically it opens up that batch download file.
22:28
Converts it into a JSON format format.
22:32
It collects views and downloads for every item, and it goes in and collect the funder name. From the Picture API as well, so you can actually report on if it's a if it's a link to a dimensions grant record.
22:47
You can pull in the specific funder name to report on that.
22:52
That funder name is not included as part of this, the metadata in this funding column, and then it also does some summaries for you. So let's say, in this case, it's pulling out some more of those JSON formatted entries in the spreadsheet.
23:11
And we can see the number, the licenses and and how many items are using each license type. We get a little summary of what's, how many files or items are embargoed, metadata only, or linked files. And then here's the report.
23:29
Or just the number and percentage of items that are linked to Publish, so, possibly useful for someone out there. And then for funder name, we can see the, the funder, and the number item items it's part of.
23:45
Then it's also, well, graph it, So it just trying to provide, like, lots of different options. This can be changed to, you know, graph different or chart different fields.
23:54
Standardizing the, the the views by the number of items actually changes the chart I should mention this is use on the Y axis and the funder name on the X axis.
24:07
We can see that the Directorate for Computer and Information Science and Engineering, they only have three items, but they have quite a few views per item.
24:17
Then, finally, you can do this by grant name.
24:20
And, this last chart here is just the top 10 categories.
24:26
In the repository, this is the count of the number of items using that category term.
24:32
So, uh, just some suggestions of possible ways to use that batch download to add to your reporting.
24:42
So, I've covered I've kinda skimmed across the top of of reporting here and given a kind of high level view, but I hope that it's at least giving you an idea of what's available and maybe giving you an idea of, of things that you haven't thought about reporting on yet.
25:05
I'm always available to answer questions. If I don't know the answer, of course, I can track down the answer for you.
25:12
Once again, if you do have questions specific by or your stats dashboards, the best thing to do is send a support ticket in support at ... dot com.
25:22
And you can also ask me, as well.
25:24
So I think I'm going to end there, and let's see.
25:28
I hope people have been able to hear me.
25:30
Looks like there's a question.
25:33
It looks like my sound was breaking up. I'm sorry about that.
25:41
Hopefully, you can all hear me OK.
25:43
Are there any other questions? Are there any questions?
26:00
No questions coming in right now.
26:07
This recording will be available.
26:11
On our second, actually, on a new website or a page that we're setting up, so that'll be available to you, one to watch it again. And I will make those API scripts available on that API help page, if those are useful for anyone.
26:27
And if there, it doesn't look like any questions have come in.
26:30
So, check the chat.
26:33
And, um, so I think we'll, we'll end it here. I want to thank everyone for attending today, Really appreciate it, and I hope to see you at future webinars.
26:46
Thanks again.