First posting here so apologies for anything I miss out.
I work in an academic library and I'm trying to profile the collection by year published. The only problem is that the dates for academic journals (which span several years) are inputted manually and therefore are written in an infinite amount of different formats.
Below I have posted an example of the types of dates.
While this is obviously a mess, the reason I retain some hope is that the only 4-digit numbers are years, so if I could somehow extract these it would give me a fighting chance.
Start date Summary Description
N1-VOL.5, 1949-1953. N1-VOL.5, 1949-1953.
1996 VOL.1 NO.1 (1996)-Vol. 8 No. 1 (2003) VOL.1 NO.1 (1996)-
1955-1977. 1955-1977.
2010 VOL.398 (2010)- Vol. 412, No. 8896 (2014 Jul. 19)
2010 VOL.398 (2010)- Vol. 412, No. 8897 (2014 Jul. 26)
In summary: does anyone know a query for 'extract 4-digit numbers from string of text and numbers'? If not has anyone got any idea how else I can solve this?
Many thanks,
Michael
I work in an academic library and I'm trying to profile the collection by year published. The only problem is that the dates for academic journals (which span several years) are inputted manually and therefore are written in an infinite amount of different formats.
Below I have posted an example of the types of dates.
While this is obviously a mess, the reason I retain some hope is that the only 4-digit numbers are years, so if I could somehow extract these it would give me a fighting chance.
Start date Summary Description
N1-VOL.5, 1949-1953. N1-VOL.5, 1949-1953.
1996 VOL.1 NO.1 (1996)-Vol. 8 No. 1 (2003) VOL.1 NO.1 (1996)-
1955-1977. 1955-1977.
2010 VOL.398 (2010)- Vol. 412, No. 8896 (2014 Jul. 19)
2010 VOL.398 (2010)- Vol. 412, No. 8897 (2014 Jul. 26)
In summary: does anyone know a query for 'extract 4-digit numbers from string of text and numbers'? If not has anyone got any idea how else I can solve this?
Many thanks,
Michael