stefanaalten
Board Regular
- Joined
- Feb 1, 2011
- Messages
- 71
- Office Version
- 365
- Platform
- Windows
I'm still looking for some help with what at first sight looks like some simple data manipulation but it has me stumped! I have data on books and authors, like this:
Author1 Book1
Author1 Book2
Author1 Book3
Author2 Book1
Author2 Book3
Author2 Book4
Author3 Book5
Author3 Book6
Author4 Book7
Author5 Book7
Authors may have produced more than one book and each book may have multiple authors.
I need to turn those cases where any two books are "related" (having same author) into rows like this:
Book1 Book2 Author1
Book1 Book3 Author1
Book2 Book3 Author1
Book1 Book3 Author2
Book1 Book4 Author2
Book3 Book4 Author2
Book5 Book6 Author3
Note that in the example Author4 and Author5 and Book7 do not feature in the second list: Author4 and Author5 have not worked on any other book(s), and Book7 is not related to another book produced by the same author(s).
I don't have masses of data yet, but as the list grows it will become impractical to do this completely manually. I would really to know how this can be automated, whether via a general workflow (e.g. involving copying-pasting), or a fully-automated formula-driven solution.
I will then use this data to construct a graph using Gephi, with the nodes representing books, and the edges (connecting the nodes) representing the book-pairs.
Can anyone help please? Many thanks in advance!
P.S. I had posted a similar question on this previously but it was not clearly stated, so hope it is alright to re-ask (?)
Author1 Book1
Author1 Book2
Author1 Book3
Author2 Book1
Author2 Book3
Author2 Book4
Author3 Book5
Author3 Book6
Author4 Book7
Author5 Book7
Authors may have produced more than one book and each book may have multiple authors.
I need to turn those cases where any two books are "related" (having same author) into rows like this:
Book1 Book2 Author1
Book1 Book3 Author1
Book2 Book3 Author1
Book1 Book3 Author2
Book1 Book4 Author2
Book3 Book4 Author2
Book5 Book6 Author3
Note that in the example Author4 and Author5 and Book7 do not feature in the second list: Author4 and Author5 have not worked on any other book(s), and Book7 is not related to another book produced by the same author(s).
I don't have masses of data yet, but as the list grows it will become impractical to do this completely manually. I would really to know how this can be automated, whether via a general workflow (e.g. involving copying-pasting), or a fully-automated formula-driven solution.
I will then use this data to construct a graph using Gephi, with the nodes representing books, and the edges (connecting the nodes) representing the book-pairs.
Can anyone help please? Many thanks in advance!
P.S. I had posted a similar question on this previously but it was not clearly stated, so hope it is alright to re-ask (?)