You are on page 1of 5

How to Implement a Technology Supported Free-Selection Peer Review Protocol

Design Implications from Two Studies on Computer Network Education

Pantelis M. Papadopoulos Thomas D. Lagkas Stavros N. Demetriadis


Informatics Department ICTE Department Informatics Department
Aristotle University of Thessaloniki University of Western University Aristotle University of Thessaloniki
Thessaloniki, Greece Kozani, Greece Thessaloniki, Greece
pmpapad@csd.auth.gr tlagkas@uowm.gr sdemetri@csd.auth.gr

Abstract—This paper presents design guidelines for CS discipline [e.g., 10, 11, 12], with promising results
implementing a free-selection peer review protocol. “Free- reported.
selection” (FS) refers to the ability of students freely access all Researchers stress the fact that peer review offers to
available peer work and choose which of them to read and students the chance of developing a range of skills important
review. A series of two studies on the free-selection protocol in the development of language and writing ability, such as
has provided evidence on the efficiency of the method. In the meaningful interaction with peers, new perspectives on the
First study, the FS protocol was compared against the widely writing process, and a greater exposure to ideas [13, 14].
used assigned-pair one, where students work in instructor- Based on that, we focused our attention on how to enhance
defined dyads. In the Second study, further issues of the FS
the peer review method by increasing even further students’
approach were evaluated, with our attention focused on
students who, due to the freedom element of the protocol, do
exposure on new perspectives and ideas.
not receive reviews. Both studies paint a very promising As a reference point we used the widely used “assigned-
picture of free-selection. However, several issues were also pair” (AP) protocol. While there are studies that assign
raised on how to effectively apply such a protocol. As the use of multiple reviewers to a single peer [15, 16], in the context of
technology is necessary in the FS approach, we provide in this our study we use the term “assigned-pair” to refer to a more
paper the design implications derived from the two studies common research design where students in a author-reviewer
regarding, (a) peer work availability, (b) non-reviewed works, dyad are assigned exclusively to each other. Students can
(c) student population, (d) peer work visual representation, (e) play both roles and review each other’s work. The overhead
peer work length, (f) presentation order, (g) double-blinded for the instructor is well contained and the activity is
approach, (h) peer reviews availability, and (i) students’ straightforward for the students. This design, of course, is not
approaches in selecting peer work for reading and reviewing. without drawbacks. One review for a single peer might not
be enough for the author to get valuable comments and
CSCL, peer review, free-selection, assigned pair, technology- suggestions for improvement. Additionally, the benefit for
enhanced learning, computer network education the reviewer may be limited, since she gets just one more
point of view (the author’s). Finally, the method requires a
I. INTRODUCTION level of stability in the dyads formation throughout the
Peer review is an instructional method aiming to help activity, while a bad pairing may have negative results.
students elaborate on domain-specific knowledge, while Our focus was to (a) enhance the learning benefits of
simultaneously developing methodological review skills. peer review for the students, without increasing the
Typically, in a peer review cycle, an author drafts a piece of minimum amount of work that they had to do, and (b) keep
work which is then evaluated by a peer. The evaluation or instructor’s overhead low. Towards this direction, we
critique is carried out anonymously on the basis of explicitly decided to investigate the efficiency of a “free-selection”
defined criteria and is subsequently returned to the author. (FS) protocol, where there are no dyads and students are free
The author is free to review his or her final draft based on the to browse all peer work and select what to review.
given critique. Yet, when practicing peer review in the We tested the effectiveness of the FS protocol in a series
classroom the instructor has a number of alternative design of two studies conducted in the same context. Results were
selections to choose from. very promising suggesting that indeed such a protocol can
The literature abounds with relevant studies indicating enhance the learning outcome. However, the implementation
that the method is popular among educators inspired mainly of the FS protocol requires the use of technology and is more
by the constructivist and socio-constructivist paradigms for complex from its AP counterpart. Through the two studies,
learning [e.g., 1, 2, 3] who want to challenge their students to we were able to identify factors that can affect the impact of
think critically, synthesize information, and communicate the FS approach.
science in nontechnical language. The method has been used In the following, we present (a) information on the two
extensively in various fields [e.g., 4, 5, 6, 7, 8, 9] and in the studies on the FS protocol, (b) useful design implications for
implementing the protocol, and (c) conclusions on the
approach.

This paper is available by IEEE Xplore at:


http://dx.doi.org/10.1109/ICALT.2012.12
II. FIRST STUDY review). To do so, we randomly selected 15 students. These
The First study [17] compared the effectiveness of the students participated as anybody else in the activity, but their
Free-Selection protocol against the typical Assigned-Pair answers did not appear to others and thus they did not
one, where students work in instructor-defined dyads received reviews. This means that all 38 students submitted
(playing both the roles of author-reviewer). reviews, but the answers of only 23 students were available
The study employed 37 sophomore students majoring in for review. We informed students beforehand of this random
Informatics and Telecommunication Engineering in a 5-year selection, although we did not divulge the identities of the
study program. We randomly assigned students into two selected ones. Five students whose answers appeared to
study conditions: Free-Selection (FS) (n=17), and Assigned- others, but did not receive reviews to all their answers were
Pair (AP) (n=20). Pre-test results analysis showed that the also added to the initial 15. The students that did not receive
two groups were comparable in terms of prior domain reviews had to conduct a self-review process, following
knowledge (p>0.05). instructions on analyzing their answers. Eventually, 20
The activity lasted two weeks. During the first week, all students worked in the Self-Review (SR) group, and 18
students studied individually online in a learning students in the Peer-Review (PR) group. Pre-test results
environment that we developed. Students had to study cases analysis showed that the two groups were comparable in
presenting past experienced in the domain of Network terms of prior domain knowledge (p>0.05).
Planning and Design (NP&D) and provide answers to three The activity followed the same pattern as before. During
realistic scenarios about various companies in need of a new the first week, students had to study individually in the
computer network. The first week was identical for the two environment and provide answers to the three scenarios on
groups. During the second week, students in the AP group the NP&D domain. During the second week, all students
worked in same-gender dyads that we randomly formatted. followed the Free-Selection review protocol, freely
Students in a dyad had to blindly review each other answers reviewing at least one answer per scenario. At the end,
and provide comments and suggestions for improvement. students in the PR group revised their initial answers based
Based on these suggestions, students could revise their initial on the comments they received, while students in the SR
answer and complete their study in the learning environment. group based their revisions only on what they read in others’
On the other hand, students in the FS group were able to see answers and in the self-review form. Post-test result analysis
the answers of all the other students and freely select which on domain knowledge acquisition showed that the two
answer to review (they had to review at least one answer per groups were comparable (p>0.05).
scenario). Because of the free format in selecting what to IV. DESIGN IMPLICATIONS
review, a small number of answers did not receive a review
and we assigned these answers for review directly to selected Based on the findings of these two studies, we propose a
students. set of design implications for the designers and instructors
Post-test result analysis on domain knowledge who wish to implement the free-selection approach.
acquisition showed that students in the FS group performed A. Make Students’ Work Freely Available to Support
significantly better than those in the AP group in acquisition Multiple Viewpoints
of domain knowledge (p<0.05). Also, FS students became
significantly better reviewers, by resembling the scoring The main idea behind the Free-Selection review protocol
patterns of expert raters employed in the study. is that students benefit more when they are allowed to read
The First study gave us clear indications of the learning as many of their peers’ submissions as they want. By reading
benefits of the Free-Selection approach in peer review multiple peer submissions on the same subject, students get
settings. However, it also highlighted a number of research to know others’ opinions and perspectives. That way, they
questions issues that needed further analysis. The most become aware of alternative approaches and argumentation.
important of them was the impact of the FS approach to Ideally, a student in a free-selection setting should be able to
students that do not receive reviews for the answers they recognize solid arguments, distinct between high and low
submit. quality works, and understand were the high quality works
converge. The degree that this process is actually happening
III. SECOND STUDY depends on the ability of each student to analyze and
compare various viewpoints.
Following the findings of the First study, the Second
Peer review protocols that strictly define and narrow
study [18] analyzed the effectiveness of the FS protocol on
down the amount of peer work that a student has access to
students that do not receive review due to the free policy on
diminish the impact multiple perspectives can have on
choosing answers to review.
learning. In addition, in these protocols, access to peer work
The Second study was conducted a year after the First
also means an obligation to produce a review. Thus, the
one, on the exact same context, meaning the students’ year of
obvious argument in favor of these protocols is that they are
study, knowledge domain, and material. The study employed
easy to implement. Students have a contained amount of
38 sophomore students majoring in Informatics and
work to perform, while the overhead for the instructor is
Telecommunication Engineering. In order to compare the
easily controlled. On the other hand, the free-selection
impact of not receiving a review, we needed similar
approach requires the use of technology in managing and
populations of students in the two conditions (review/no
controlling the activity. However, this technology is readily
available. Although we developed our own technology- analyze their initial answers and compare them to what they
enhanced learning environment for the research needs of the have read in other answers. Results analysis showed that
two studies, an interested researcher could apply the method students that did not receive reviews achieved the same level
using technological tools such as forums, and LMSs. of performance as the students that did. This is a clear
The important thing to note about the Free-Selection indication that the lack of reviews can easily be handled in
protocol is that by granting access to all submissions, we do the FS protocol.
not oppose additional workload to the students. We just give A designer could apply various strategies to ensure that
them the option to get better informed about what others did, each submission will receive a review in FS (e.g., by
before conducting a review. In the First study, the students applying a first-come-first-served method on the availability
took advantage of this and read on average 8 (out of the 16 of the submissions). However, this affects the degree of
available) answers before selecting which ones to review. freedom the students have in selecting submissions to read
They reviewed on average 2 answers in each scenario. This and review. Based on the findings of the Second study, we
was their choice and as they said in the interviews, the fact believe that such strategies are not necessary.
that they were free to choose the number of answers to read
and review (they had to submit at least on review) made C. Apply to Medium Size Groups
them develop a positive attitude towards the activity. On the The size of the group in which the FS protocol is
other hand, there were also students that chose to apply a implemented can affect the learning outcome. Although the
minimum effort strategy and read and review only one goal is to present multiple peer works to the students, we
answer in each scenario. These students essentially shifted need to provide students with an amount of data that they can
from the FS setting to the AP one. In other words, even in handle. In the First study, there were 17 students in the FS
the extreme cases were students opt to read and review only group, meaning that each student had access to 16 answers in
one peer work, the effectiveness of the FS method would be each scenario. Usage data showed that students read on
at the same level as the typical AP approach. average 8 answers per scenario. The higher the number of
available answers, the higher the probability that students
B. Apply Self-Review Instead of an At-Least-One-Review- will select randomly which ones to read. That is why in the
Per-Submission Strategy Second study, we decided to show a random subset of 9
The most important critique against the Free-Selection answers to each student. Each answer appeared the same
protocol is that the method is based on the randomness of the number of times and had equally chances of receiving review
students’ selection process, thus allowing the existence of comments. An interested designer, should either divide large
non-reviewed submissions. Applying a review protocol that groups of students into smaller ones and implement the FS
does not guarantee at least one review per submission may independently in each one, or include all students in one
seem counter-intuitive. On the contrary, though, we maintain group and randomly provide access to a number of answers
that the effectiveness of the FS approach relies more on that would not overwhelm the students.
conducting the review process itself and not on receiving
D. Provide an Effective View of All the Submissions
review comments.
A student is benefited by a review process in three ways: Students working in a Free-Selection setting have to
(a) by reading peer work, the student becomes aware of other make two choices. First, they have to select which answers to
points of view, (b) the process of analyzing and critiquing read, and second, which of these answers to review. The
peer work engages student deeper with the learning material, explanation behind the second choice is probably clearer for
since this is the background on which review comments and us. The student has a picture of what others said and, based
suggestions should be rooted, and (c) the student receives on a personal criterion, makes a selection. As we will later
review comments and suggestion to improve the initial explain, this criterion (or set of criteria) can be different for
answer. Items (a) and (b) can be significantly enhanced in a each student, but eventually explains student’s behavior in
free-selection setting. As we already mentioned, in FS, the activity. What is not that clear is how the student selects
students have much more access to others’ views that in any which of the available answers to read. This selection is very
other review protocol. The results from the two studies important, because it defines student’s expose to others’
showed that students submit more reviews when in FS (First: opinions and automatically excludes from reviewing the rest
M=1.9; Second: M=1.7, per scenario for FS, M=1.0 for AP of the answers.
in First). This means that student engagement is also We propose two design guidelines to assist students in
increased. Regarding item (c), there is indeed an this selection. First, the designer must provide a visual
unpredictable number of students that may not receive representation of the set of available answers in a way that it
comments. This, of course, happens because the rest of the will be easy for the students to get an idea of the available
students receive multiple reviews (First: max=3; Second: material. For this, we chose to present the peers’ answers in a
max=6, in a single scenario). Thus, for some students, even grid, rather than in list, to avoid scrolling and make it easier
item (c) is enhanced in FS. for the students to print this grid. We believe that a list gives
The question is whether the lack of review comments has a sense of order, sequence, or importance. While using a
a significant effect on students. This was exactly the focus of search engine or reading the news online, the student, as any
the Second study. We addressed the lack of review other user, is used (or even encouraged) to pay more
comments with a self-review form where the students had to attention to the top of the list. The use of a grid was our
Figure 1. Answer grid. According to this figure, the student has read answers 1, 2, 3, 5, and 9 (marked by an eye
icon), and has reviewed answers 1 and 3 (marked by a bullet icon). The complete grid had 16 answers for the Free-
Selection group.

approach to circumvent this bias. Other representations may focus only on answers appearing in the first places. As a
be used, having in mind to present all submissions as equally good measure we applied a simple algorithm to randomly
important. The students had to click on an answer in the grid present the answers. We used students’ username in the
to view the whole answer text. system as the algorithm seed so that each student would view
The second design guideline is to give students the a different answer grid that would remain the same each time
chance to get a rough idea of what each answers says, the student logins in the system.
without reading the whole submission. In our case, the Usage analysis showed that both the readings and the
knowledge domain was Network Planning and Design, and reviews were widely spread throughout the grid and this was
students included characteristics of the network designs they one of the reasons we had so few non-reviewed answers in
proposed even in the first sentences of their answers. Thus, the First study.
including only the first 3-4 lines of each answer in the grid
boxes was usually enough to give students a glimpse of what G. Double-Blinded Review to Move Focus on Submission
their peers were suggesting (Fig. 1). Content
In both our studies, students’ answers were usually a We applied a double-blinded review process in both our
little less that a whole page. In settings were students studies. Our goal was to analyze students’ choice in selecting
produce lengthier deliverables, additional information should which answers to read and review when they had to base
be given to the students. For example, in case the students selection solely on the answer text. This allowed us to avoid
have to submit papers expanding in several pages, abstracts any biases. In the First study, we asked students whether they
should be required so that they can be presented in the grid. wanted to know the identity of the authors they reviewed and
The goal in FS is to present shortly the main points of each of their reviewers. Students answered using a 5-step Likert
answer without having the students reading the whole text. scale (1=No, 5=Yes). Students in the Assigned-Pair group
That way, they will be able to take an informed decision on were divided regarding the authors’ identities (M=2.50), but
which answers to read and which to skip. most of them wanted to know who commented on their
answers (M=3.43). On the contrary, students in the FS group
E. Preferred for short deliverables did not care about others’ identity (M=1.36, M=1.55,
It was very important that the students’ answers were respectively). Thus, one could choose to apply a non-blinded
relative short, rarely filling a whole page. This was approach. We believe, however, that students’ behavior
commented by the students in the interviews, saying that it could change in case peers’ identities are freely available.
was easy to read a lot of answers because of their length.
One should expect that as the answers become longer, the H. Control Access of Submitted Reviews
possibility of students reading more of them gets lower. This, In our studies, we chose to present the review comments
in turn, diminishes one of the main purposes of the FS only to the respective authors, thus controlling students’
approach, which is to make the students learn more by interaction. In that way we were able to better control the
analyzing different opinions. variables under investigation in each study. However, one
could opt to make the review comments public, thus
F. Random Appearance of Submissions allowing deeper interaction between the students. This could
One guideline that is connected to the previous section is affect the impact of the FS approach in several ways.
to randomly present the answer into the answer grid to First, students are better informed of each other’s
enhance the spread of reviews, even if the students decide to perspectives, since each review represents in a sense the
opinion of the reviewer on the subject. This can be beneficial REFERENCES
for the students. [1] Topping, K. (1998). Peer assessment between students in colleges and
On the other hand, available reviews may affect future universities. Review of Educational Research, 68, 249-276.
reviewers’ judgment. The first review may act as the [2] Falchikov, N. (2001). Learning together: Peer tutoring in higher
dominant one, while it is not likely that students would education. London: Routledge Falmer.
choose to submit conflicting reviews. It is more probable that [3] Liu, C. C., & Tsai, C. M. (2005). Peer assessment through web-based
students will submit converging reviews, and in that sense knowledge acquisition: tools to support conceptual awareness.
reproduce same comments and suggestions. Innovations in Education and Teaching International, 42, 43-59.
Finally, having the reviews publicly available diminishes [4] Falchikov, N., & Goldfinch, J. (2000). Student Peer Assessment in
Higher Education: A Meta-Analysis Comparing Peer and Teacher
the chance of non-reviewed answers. It will be clear which Marks. Review of Educational Research, 70(3), 287-322.
answers have already been reviewed and which ones are left [5] Dossin, M. M. (2003). Among Friends: Effective Peer Critiquing. The
non-reviewed. Thus, students will be able to provide one Clearing House, 76 (4), 206-207.
review to each answer. [6] Anewalt, K. (2005). Using peer review as a vehicle for
communication skill development and active learning. Journal of
I. Factors that Affect Students’ Selection on Selecting a Computing in Small Colleges, 21(2), 148-155.
Submission for Review [7] Turner, S., Pérez-Quiñones, M.A., Edwards, S., & Chase, J (2010).
Base on usage data and interview statements in the two Peer Review in CS2: Conceptual Learning. Proceedings of
studies, there are two main criteria students apply in SIGCSE’10, March 10–13, 2010, Milwaukee, Wisconsin, USA.
selecting which answers to review. Half of the students try to [8] Liou, H. C., & Peng, Z. Y. (2009). Training effects on computer-
mediated peer review. System, 37, 514–525
avoid conflict and select answers that, according to them, are
of high quality. This way the students are able to provide [9] Goldin, I. M., & Ashley, K. D., (2011). Peering Inside Peer Review
with Bayesian Models. In G. Biswas et al. (Eds.): AIED 2011, LNAI
complementing reviews saying only nice things for their 6738, pp. 90–97. Springer-Verlag: Berlin.
fellow students. On the contrary, another half of the students [10] Crespo, R. M., Pardo, A., & Kloos, C. D. (2004). An Adaptive
selects answers that, according to them, present obvious need Strategy for Peer Review. Paper presented at ASEE/IEEE Frontiers in
for improvement. This way it is easier for the students to Education Conference. Savannah, GA.
write more meaningful reviews. Of course, students’ [11] Ziu, E.Z.F., Lin, S., Chiu, C.H., & Yuan, S.M. (2001). Web-based
opinions about high and low quality answers are not always peer review: the learner as both adapter and reviewer. IEEE
in line with the instructor’s judgment. However, these two Transactions on Education, 44, 246-251.
ways of thinking are, most of the times, behind students’ [12] Demetriadis, S., Egerter, T., Hanisch, F., & Fischer, F. (2011). Peer
review-based scripted collaboration to support domain-specific and
behavior. Other factors that, according to students, affect the domain-general knowledge acquisition in computer science.
selection process are the length of an answer (medium size Computer Science Education, 21(1), 29-56.
answers are usually preferred), and how clear an answer is [13] Hansen, J., & Liu, J. (2005). Guiding principles for effective peer
(students in general refrain from reviewing answers that they response. ELT Journal, 59, 31-38.
do not understand). [14] Lundstrom, K., & Baker, W. (2009). To give is better than to receive:
The benefits of peer review to the reviewer’s own writing. Journal of
V. CONCLUSIONS Second Language Writing, 18, 30-43.
Results on the Free-Selection approach have been [15] Tseng, S., & Tsai, C-C. (2007). On-line peer assessment and the role
of peer feedback: A study of high school computer course. Computer
encouraging so far, and as the availability of the necessary & Education, 49, 1161–1174.
technological tools makes the implementation easy, we [16] Tsai, C.-C., & Liang, J.-C. (2009). The development of science
expect similar approach to gain researchers’ attention. In the activities via on-line peer assessment: The role of scientific
two studies, we tested a generic FS format. However, as we epistemological views. Instructional Science, 37, 293-310.
presented here, there is a number of factors that can be [17] XXXXXX (2012). Computers & Education.
modified and affect the learning outcome. The 9 implications [18] XXXXXX (in preparation).
presented in the previous sections can provide a useful
picture to the instructor who wishes to implement similar
peer review methods.

You might also like