New content added in past few weeks

Here’s an overview of the new content added in the past few weeks. Two collections are of particular note: the Lloyd’s List for 1812, via 1812Privateers.org, and the Dyal Ship Collection. One man, Michael Dun, has digitized and indexed all of the issues of Lloyd’s List for the entire year of 1812. It’s quite a feat. He’s indexed all of the ships and all of the masters for that time, adding up to nearly 26,000 ship citations in all the issues of Lloyd’s List for 1812. He kindly shared his index with me, so I could include links to his resources. Mr. Dun hosts the pages on his servers, and they are accessible to all via that site. While working through the index of ship names that he provided to me, I was able to identify a number of corrections, and I incorporated those into the file I imported.

Working through this file was also an interesting reminder about the challenges we face in trying to make the most of these primary sources. Clearly, the folks who were putting together each issue of Lloyd’s List (it usually came out twice a week, and was published in London) were trying to get information out as quickly as possible, and weren’t too concerned with absolute accuracy, to say nothing of how researchers two centuries later would like them to present information.

As a few examples, each of the following slight spelling variations by the editors are likely the same ship: Misletoe, Misseltoe, and Missletoe (there’s no Mistletoe listed in this year of Lloyd’s!). Or, Nymph, Nymphe, and Nymphen. Or Powhatan, Powahattan, and Powhatton. Or Zenophon and Zenophen, when the proper spelling is Xenophon. Or Tinmouth Castle, most  likely meaning Teignmouth Castle. Or simple errors, like Hepsa instead of Hespa.

Of course, if you’re reading this at a London coffee shop one morning in 1812, you can easily look over these minor errors, and figure out what the editors’ intent was. But for researchers two centuries later, who are trying to mine large amounts of data to see what they can find, these errors cause a problem. So how do we address them? That’s an issue for an upcoming blog post. But, needless to say, we at ShipIndex.org have a solution…

Another interesting addition is the Dyal Ship Collection, but for very different reasons. This is a collection of images and data compiled by a researcher (in this case, a librarian) and added to his institution’s “institutional repository” (IR). An IR is a site, usually maintained by an academic library, where content generated by the institution’s faculty, staff, and students is made available for free. It is, in a large sense, a reaction to the high cost of many academic journals, where an institution’s researchers spend time and money doing and compiling research, then pay to have that published in a scholarly journal, then the institution pays to buy the results back, through a subscription to the journal. The whole discussion is beyond the scope of this blog post, but the point is that IRs are places where interesting and useful information can be stored — but it’s most often quite hidden, unless there’s some effective way of indexing the content.

So, with the encouragement and assistance of the compiler, we’ve created links into the collection of files and images that are stored in Texas Tech University’s institutional repository. Recently, we’ve heard from others who have data they’d like us to include, and we’re looking at ways of doing that effectively. This is just one example of that.

Other items we’ve added are mostly more standard print or online collections. The total list is as follows:

If you have maritime content that you’d like to get online, or is online but needs broader publicity, please let us know. We’d love to find a way to help.

Full text links from within ShipIndex

ShipIndex.org links to the full-text for nearly 85% of its citations! Before Mike ran the numbers, I guessed that a conservative estimate on links to full text would be at about 70%, so the 85% number was quite a surprise, but it’s true.

How did we do this? First, we’re linking to lots and lots of content online. There are so many free online resources with information about ships out there, and I feel like I find another one every week. But other than ShipIndex, there’s no place that brings all these resources to one place, and no way to search all of them at once. However, with ShipIndex, that’s what you’re doing. But that doesn’t get one to 85%.

Recently, we started looking for resources in Google Books. The next time you’re searching in ShipIndex and you see a hotlinked page number, try clicking on that page number. It should take you right to the page of the book within Google’s Book Search project.

Here are two examples from freely-available resources:

  • The citations for Aroostook, from Paul Calore’s Naval Campaigns of the Civil War, has a link to page 128, and the vessel is mentioned near the start of the last paragraph.
  • The citation for City of Pekin, from Arthur Clark’s The Clipper Ship Era, has a link to page 86, and the ship is mentioned about 2/3 of the way down the page.

This was an interesting experience, and I learned a lot when we did it. The goal was to try and link directly to the page that cited a specific ship. I discovered four different levels of Google Books linking:

  • No content: The book just can’t found, or it’s cited but offers no view into it at all
  • Snippet view: With snippet view, you really do only get just a touch of the book, and it’s hard to know how much or what you’ll get. Most importantly, you can only search by terms, you can’t ask Google to show you all of a specific page.
  • Preview: With preview, Google offers most of the pages of a book. This is common for recently-published works, and Google works with the publisher to figure out what they’ll show. The idea, obviously, is to show enough that someone wants to go out and buy the full book.
  • Full view: For these books, Google shows the entire thing. These are primarily books that are out of copyright protection – so, published before 1923.

We only activate links for books that are available via Full View and Preview — and we only do the Preview if it appears that most links will get to the page in question. We’ve found a few titles that are available in Preview, but so many links go to pages that aren’t visible, perhaps because the publisher only allows 10-20% of the book to be shown via Google Books, that it seems misleading to offer those links.

Links to Snippet views don’t work because there’s no way to get to a specific page. You could try to search for the ship name, but if the ship name is something like “Elizabeth”, then you’ll get every mention of “Elizabeth” in the book – including names of people, not just ships. Also, the searches just don’t work as well. This could be a result of problems in OCR work, too – if the OCR work isn’t very good, then Google won’t find specific phrases, and with the page linking, we’re going to a specific page, not searching for a ship name in the book’s text.

So, as a result, you’ll most likely find linking to Google for very old books (via Full View) and very new books (via Preview).

The horror stories about metadata in Google Books are very true. It’s a mess for any slightly complicated title, such as multi-volume sets. So, finding Navy Records Society volumes — especially multi-volume works that weren’t published consecutively — was sometimes quite a challenge. And, in some cases, volumes that should be available just aren’t. I found one book that was completely upside down. Others have lousy scan quality. But the fact is that an enormous amount of content is available from anyone’s computer now, and it will only improve.

Try it out; see what you think.

From the 9th Maritime Heritage Conference, Baltimore

I’m writing from the 9th Maritime Heritage Conference, in Baltimore, right now. The Maritime Heritage Conference takes place every three years, and I’ve had the opportunity to attend a few conferences in the past. It’s neat to get reconnected with friends in the maritime history community, and find out what’s been happening in the maritime history community.

Given the subject, we’ve had some great conference receptions on board ships, and I must admit I’ve failed to take advantage of seeing the most of these ships. I certainly attended, and wandered around a bit, but (so far) I didn’t explore the vessels as much as I should have. On Wednesday evening, when I arrived, we had a reception on board the Liberty Ship John W. Brown. The folks running the Brown have done a great job in putting together a walking tour of an incredible amount of the very large ship. The Brown is also nicely represents a specific time – 1944, when it’s getting ready to travel on a convoy across the North Atlantic. The folks working and volunteering on board the Brown have had a lot of history with these ships, and some attendees told me about talking with the volunteers, some of whom began working on these ships when they were operating in convoys, or soon after the War.

Last night’s reception was on board USS Constellation, and again I enjoyed it, but didn’t take advantage of going through all levels of the ship. However, I understand today that I can board any time during the conference, so I hope to get a chance to go again.

Tomorrow morning, there’s a tour of NS Savannah, the first nuclear merchant ship, which is moored in Baltimore while its future is being decided. I hope I’ll be able to participate, though the tour is quite long and I am also giving a talk about ShipIndex.org tomorrow afternoon and need to be sure I’m fully ready to give this presentation.

Tomorrow evening, we’re scheduled to have a reception on board USCG Barque Eagle, which arrived in Baltimore today. It may have done so; I haven’t looked out yet to see if there’s a new set of masts in the Inner Harbor. I feel certain we won’t be able to go below on board Eagle, so I should feel OK about just standing on the deck tomorrow evening!

New feature: tracking ship updates

Here’s the third blog post for the morning. It’s definitely the most exciting. We’ve just released the mostest coolestest feature of ShipIndex since starting the site. (OK, so that’s admittedly my personal opinion, but I think it’s also a fact.)

Effective immediately, anyone with an account (that is, anyone who has created a username – you don’t need to be a subscriber) can be notified whenever a ship page is updated with new information. So, if you’re particularly interested in a vessel named Unanimity, you can go to that page, click on the button near that top that reads “NOTIFY ME when this page is updated”, and then whenever new content is added, you’ll get an email telling you so!

If you’re a subscriber, you’ll see what resource the content is from. You can go to the page directly, and check out the new citation.

If you’re not a subscriber, you’ll be notified that new citations have been added. You may decide it’s finally time to take gain access to everything that’s available on the site. Or, perhaps you use ShipIndex.org through a subscription provided by your local public or academic library. Go to your ship’s page and locate the new citations, which are always marked by a “new” icon for 45 days from the addition of the resource.

You’ll get just one email containing updates for all the ships you’re tracking, not a separate email for each ship, or each citation. Emails are sent in batches, several times per week, reflecting all the data added since the last update.

When you’re done following a vessel, you can just go to the ship page, click on the button that reads “CANCEL NOTIFICATIONS for this ship”, and the emails will stop.

You need to be logged in, so that we can keep track of how to notify you when a page is updated. But, as mentioned above, you DON’T need to be a subscriber. Also, from your profile page, you can see all the vessels you’re tracking, and clear all your notifications, or go to each page and modify them individually.

I truly believe this is an enormous step forward in what we’re offering via ShipIndex.org. You no longer need to come to the site to check on updates regarding the ships that interest you; we’ll take care of that for you. Now, when new citations are added for the ships that matter to you, you’ll be the first to know.

Please try it out, and let us know what you think. Remember: you do need to have an account, but you don’t need to be a subscriber.

I hope you’re as excited about this as I am.

Most commonly used US Navy vessel names

I was doing a bit of data cleanup today, and found some moderately interesting items. I was looking at the Dictionary of American Naval Fighting Ships, and correcting the way we represented some ship names – specifically those that were used multiple times by the US Navy. In looking over the information we have about US Naval vessel names, I found that there were about 1451 names that were used at least twice; 470 used at least three times; 182 used at least four times; 83 used at least five times; and 30 used at least six times.

Boston, Shark, and all those that follow have each been used seven times; Enterprise, Hornet, Morris, Niagara, and Washington each top out at eight uses. Wasp has been used nine times, and Ranger has been used ten times.

These numbers don’t include ships that already entered with numbers in their name, such as Lexington II; Lexington II entered the Navy with that name and kept it, while each of the five various naval vessels named Lexington all kept the same name, Lexington.

These numbers are most likely pretty close to accurate, though if you spot an anomaly among them, please let me (and other readers) know. I analyzed the names of the vessels listed in DANFS to come up with the numbers, so it’s limited to the vessels included in the current DANFS online at the navy.mil site.

Navy Records Society volumes and other new content

The following content was added in the last few days. We’ve added the content of indexes from nearly a dozen additional Navy Records Society volumes, as well as several other monographs covering a wide range of time periods and geographic regions.

Stay tuned for several additional updates.

ShipIndex is taking on crew!

Hoo-boy. Big Day here at ShipIndex.org’s Eastern US World Headquarters.

We’ve decided that it’s time to find the right person to help us with institutional sales. To that end, we are putting out this job announcement and are looking for someone to join our team. If you’re that person, or know someone who might be, please let them and us know.  Please help us by sharing this information widely.

In a nutshell, this is a position for a person who knows libraries, and knows library sales. This is a work-from-home position, and we don’t necessarily expect a full-time commitment, though because of the graduated commission structure, it might be worth it. (We can talk about salaries and commission further down the line, maybe not right here on the blog.) The job doesn’t require a lot of travel, except for the usual big library conferences.

The posting is below; please let us know if you have questions, or would like to be considered for the position. We hope to make a decision, and get moving on this, as quickly as we can.

Manager, Institutional Sales, ShipIndex.org

ShipIndex.org seeks a part-time or full-time person to lead and manage all aspects of the company’s institutional sales. The successful applicant will have a documented history of successful institutional sales management; a demonstrated ability to work independently as a self-starter; and an understanding of libraries and how they use and manage electronic resources.

ShipIndex.org helps people do research on specific ships, boats, and vessels. We have a database of over 1.3 million citations – and growing – that tells people what books, journals, websites, and databases mention the vessel they’re researching. We offer our service directly to consumers and also to institutions. ShipIndex.org is a valuable tool for public, academic, and special libraries, primarily in supporting genealogy and history, but with additional application in many other fields. The successful applicant’s responsibility will be all institutional sales, in the US and abroad, with support as needed from the rest of the company. Physical location is not an issue, though the individual must be able to work in the US legally.

Compensation is primarily commission-based, with a part-time salary component. While we expect a minimum of 20 hours per week invested in the work, most of the compensation is in a sliding-scale commission structure, so there is a clear benefit to a greater time investment. This is a telephone sales position, so minimal travel is expected, with the exception of occasional conferences, such as ALA Annual, ALA Midwinter, PLA, ACRL, and others, as appropriate. The successful candidate will participate in decisions regarding which conferences s/he attends.

Responsibilities include following up on leads generated online and at conferences, generating new leads, explaining the product and its benefits to potential customers, managing consortial sales and promotion, advising the company on marketing and sales strategies and tools, helping customers through the invoicing and licensing process, providing limited support as needed and with significant assistance from the rest of the company, and other duties as necessary in guiding institutional sales.

At present, ShipIndex.org consists of two owners, who live on opposite sides of the country. The successful candidate will be the company’s first employee; applicants must be certain they’re comfortable working in this size of a company.

If you’re interested in applying, please submit a work history and a cover letter explaining your interest in the position and the library industry. An interest in maritime history is also helpful, but not required. Please include the names of at least three references. All applications will be held in strictest confidence.

We welcome questions about the position. Questions and applications may be submitted to careers [at] shipindex [dot] org.

ShipIndex as bag sponsor at 9th Maritime Heritage Conference

I’m excited to report that ShipIndex is a Bronze level sponsor for the upcoming Maritime Heritage Conference in Baltimore, this coming September. We’ll also be sponsoring the conference bags, which is particularly cool. This is the first sponsorship that we’ve undertaken so far, and we hope that it will go well.

A lot of what we need to do right now is get our name out there, so that the appropriate people learn what we’re doing, what our benefits are, and why their institutions should subscribe. (Of course, we also offer individual subscriptions, which are certainly a good thing, too — but they’re not appropriate for institutions, for a variety of reasons.)

So, getting our name (and our very cool logo) in front of several hundred maritime historians should be a very good thing. I’m going to attend, and I’ll spend my time talking with folks, too, about what we offer. We’ll have to see what comes from the event, and decide if it’s worth doing at other conferences in the future. It costs money, obviously, and that’s in reasonably short supply at the moment, but I think that, in the end, it’ll be worth doing. We’ll just have to wait and see, I guess.

A friend told me I should have put together a presentation about ShipIndex.org, and he pointed out all sorts of great stuff I could have done — talking about how we’re actually doing it, what problems we’re facing, what the implications are for unique vessel identifiers (especially for ships of a previous era, before IMO numbers and other modern identifiers), how developing identifiers for non-extant vessels could benefit researchers, and more. I wish I’d thought of it in time to submit a proposal, but I didn’t. Alas. But I think it’s actually a very interesting story, and I think that there’s quite a lot one can learn just from analyzing and discussing this very big database we’ve built (and continue to add to), so I hope I’ll find a good opportunity to talk about this some time in the not-too-distant future. If you think of a spot, please let me know.

And, of course, if you’ll be attending the conference, or if you’ll be in Baltimore during it, and you’d like to talk about ShipIndex.org, please tell me. It’s nearly my favorite subject, so I’m always happy to talk about it.

New functionality: Citation counts

Mike has built a nice new piece of the website that tells you how many citations you’ll find for each entry, and what type of resource you’ll find them in.

If you’re accessing the freely-accessible content, and don’t have a subscription, you’ll see how many citations are in the free database, and how many are in the complete database (ie, both the free and the premium databases). Each listing also shows what types of resources are listed, too. For example, if you’re using the free content, and you search for “Columbus“, you’ll see a message that reads:

The free database contains 112 citations from 40 resources, including 37 books, 2 journals, and 1 online resource, with 1 illustration.

The complete database contains 574 citations from 71 resources, including 51 books, 8 journals, and 12 online resources, with 3 illustrations and 24 passenger or crew lists.

Note that we also indicate how many illustrations and passenger or crew lists you’ll find in each part of the database, as well. This gives you a better feel for what to expect, if you’re trying to decide whether or not you should subscribe.

If you’re searching the premium database, you’ll see an entry like the following:

This ship has 574 citations from 71 resources, including 51 books, 8 journals, and 12 online resources, with 3 illustrations and 24 passenger or crew lists.

Of course, these numbers will change as we add more content.

We hope this will be especially useful for folks who are trying to decide if they should subscribe or not, but they’ll also be quite valuable for subscribers to ensure they’re seeing everything there is to see about their vessel.

Enjoy.

New clients!

We’ve added several new clients in the past few months, but I’ve forgotten to mention them. These include

  • National Maritime Museum (Greenwich, England)
  • UCLA
  • US Merchant Marine Academy
  • San Francisco Maritime National Historic Park
  • Peabody Essex Museum (Salem, MA)

Also, we have a number of institutions currently running trials, including:

  • Family History Library (Salt Lake City)
  • Library of Congress
  • US Naval Academy
  • Library of Virginia
  • La Crosse (WI) Public Library
  • Siuslaw (OR) Public Library

If you’re associated with any of these institutions, you should be able to access the complete contents of the site, without any problems at all. If you’re not associated with any of these institutions, you can always ask your local librarians to investigate a subscription to ShipIndex.org, and ask them to set up a free trial.